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207 Human Secreted Proteins 

Field of the Invention 

This invention relates to newly identified polynucleotides and the polypeptides 
encoded by these polynucleotides, uses of such polynucleotides and polypeptides, and 
5 their production. 

Background of the Invention 

Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 

10 organelle, contains different proteins essential for the function of the organelle. The cell 
uses ^'sorting signals," which are amino acid motifs located within the protein, to target 
proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a leader 
sequence, directs a class of proteins to an organelle called the endoplasmic reticulum 

15 (ER). The ER separates the membrane-bounded proteins from all other types of 

proteins. Once localized to the ER, both groups of proteins can be further directed to 
another organelle called the Golgi apparatus. Here, the Golgi distributes the proteins to 
Vesicles, including secretoiy vesicles, the cell membrane, lysosomes, and the other 
organelles. 

20 Proteins targeted to the ER by a signal sequence can be released into the 

extracellular space as a secreted protein. For example, vesicles containing secreted 
proteins can fuse with the cell membrane and release their contents into the extracellular 
space - a process called exocytosis. Exocytosis can occur constitutively or after receipt 
of a triggering signal. In the latter case, the proteins are stored in secretory vesicles (or 

25 secretory granules) until exocytosis is triggered. Similarly, proteins residing on the cell 
membrane can also be secreted into the extracellular space by proteolytic cleavage of a 
"linker" holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins include 

30 the commercially valuable human insulin, interferon. Factor VIII, human growth 
hormone, tissue plasminogen activator, and erythropoeitin. Thus, m light of the 
pervasive role of secreted proteins in human physiology, a need exists for identifying 
and characterizmg novel human secreted proteins and the genes that encode them. This 
knowledge will allow one to detect, to treat, and to prevent medical disorders by using 

35 secreted proteins or the genes that encode them. 
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Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host cells, antibodies, 
5 and recombinant methods for producing the polypeptides and polynucleotides. Also 
provided are diagnostic methods for detecting disorders related to the polypeptides, and 
therapeutic methods for treating such disorders. The invention fiirther relates to * 
screening methods for identifying binding partners of the polypeptides. 

10 Detailed Description 

Definitions 

The following definitions are provided to facilitate understanding of certain 
terms used throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 

15 environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 
matter, or particular cell is not the original environment of the polynucleotide. 

20 In the present invention, a "secreted" protein refers to those proteins capable of 

being directed to the ER, secretory vesicles, or the extracellular space as a result of a 
signal sequence, as well as those proteins released into the extracellular space without 
necessarily containing a signal sequence. If the secreted protein is released into the 
extracellular space, the secreted protem can undergo extracellular processing to produce 

25 a "mature" protein. Release into the extracellular space can occur by many 
mechanisms, including exocytosis and proteolytic cleavage. 

As used herein , a "polynucleotide" refers to a molecule having a nucleic acid 
sequence contained in SEQ ID NO:X or the cDNA contained within the clone deposited 
with the ATCC. For example, the polynucleotide can contain the nucleotide sequence 

30 of the full length cDN A sequence, including the 5' and 3' untranslated sequences, the 
coding region, with or without the signal sequence, the secreted protein coding region, 
as well as fragments, epitopes, domains, and variants of the nucleic acid sequence. 
Moreover, as used herein, a "polypeptide" refers to a molecule having the translated 
amino acid sequence generated from the polynucleotide as broadly defined. 

35 In the present invention, die full length sequence identified as SEQ ID NO:X 

was often generated by overlapping sequences contained in multiple clones (contig 
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analysis). A representative clone containing all or most of the sequence for SEQ ID 
NO:X was deposited with the American Type Culture Collection ("ATCC"). As 
shown in Table 1, each clone is identified by a cDNA Clone ID (Identifier) and the 
ATCC Deposit Number. The ATCC is located at 10801 University Boulevard, 
5 Manassas, Virginia 201 10-2209, USA. The ATCC deposit was made pursuant to the 
terms of the Budapest Treaty on the intemational recognition of the deposit of 
microorganisms for purposes of patent procedure. 

A "polynucleotide*' of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences contained 

10 in SEQ ID NO:X, the complement thereof, or the cDNA within the clone deposited widi 
the ATCC. "Stringent hybridization conditions" refers to an overnight incubation at 42° 
C in a solution comprising 50% formamide, 5x SSC (750 mM NaCl, 75 mM sodium 
citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran 
sulfate, and 20 ^ig/ml denatured, sheared salmon sperm DNA, followed by washing the 

15 filters in O.lx SSC at about 65*^0 

Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished through the manipulation of formamide concentration (lower percentages 
20 of formamide result m lowered stringency); salt conditions, or temperature. For 
example, lower stringency conditions include an overnight incubation at 37''C in a 
solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M NaH2P04; 0.02M EDTA, 
pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml sahnon sperm blockmg DNA; 
followed by washes at 50°C with IXSSPE, 0.1% SDS. In addition, to achieve even 

25 lower stringency, washes performed following stringent hybridization can be done at 
higher salt concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blockmg reagents include 
30 Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 

conmiercially available proprietary formulations. The mclusion of specific blocking 
^ reagents may require modification of tiie hybridization conditions described above, due 
to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such 
35 as any 3* terminal polyA+ tract of a cDNA shown in die sequence listing), or to a 
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complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide." since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone). 
5 The polynucleotide of the present invention can be composed of any 

polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA 
or modified RNA or DNA. For example, polynucleotides can be composed of single- 
and double-stranded DNA, DNA that is a mixture of single- and double-stranded 
regions, single- and double-stranded RNA, and RNA that is mixture of single- and 

10 double-stranded regions, hybrid molecules comprising DNA and RNA that may be 
single-stranded or, more typically, double-stranded or a mixture of single- and double- 
stranded regions. In addition, the polynucleotide can be composed of triple-stranded 
regions comprising RNA or DNA or both RNA and DNA. A polynucleotide may also 
contain one or more modified bases or DNA or RNA backbones modified for stability 

15 or for other reasons. "Modified" bases include, for example, tritylated bases and 
unusual bases such as inosine. A variety of modifications can be made to DNA and 
RNA; thus, "polynucleotide" embraces chemically, enzymatically, or metabolically 
modified forms. 

The polypeptide of the present invention can be composed of amino acids joined 

20 to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and 
may contain amino acids other than the 20 gene-encoded amino acids. The 
polypeptides may be modified by either natural processes, such as posttranslational 
processing, or by chemical modification techniques which are well known in the art. 
Such modifications are well described in basic texts and in more detailed monographs, 

25 as well as in a voluminous research literature. Modifications can occur anywhere in a 
polypeptide, including die peptide backbone, the amino acid side-chains and the amino 
or carboxyl termini. It will be appreciated that the same type of modification may be 
present in the same or varying degrees at several sites in a given polypeptide. Also, a 
given polypeptide may contain many types of modifications. Polypeptides may be 

30 branched , for example, as a result of ubiquitination, and tiiey may be cyclic, with or 
without branchmg. Cyclic, branched, and branched cyclic polypeptides may result 
from posttranslation natural processes or may be made by synthetic methods. 
Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 

35 nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-links, formation of cysteine. 
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formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI 
anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, 
pegylation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins 
5 such as arginylation, and ubiquitination. (See, for instance, PROTEINS - 

STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton,W. 
H. Freeman and Company, New York (1993); POSTTRANSLATIONAL 
COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed, Academic 
Press, New York, pgs. 1-12 (1983); Seifter et al., Meth Enzymol 182:626-646 (1990); 
10 Rattan et aL, Ann NY Acad Sci 663:48-62 (1992).) 

"SEQ ID NO:X" refers to a polynucleotide sequence while ■ 'SEQ ID NO: Y" 
refers to a polypeptide sequence, both sequences identified by an integer specified in 
Table 1. 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
1 5 activity sunilar, but not necessarily identical to, an activity of a polypeptide of the 

present invention, including mature forms, as measured in a particular biological assay, 
with or without dose dependency. In the case where dose dependency does exist, it 
need not be identical to that of the polypeptide, but rather substantially sunilar to the 
dose-dependence in a given activity as compared to the polypeptide of the present 
20 invention (i.e., the candidate polypeptide will exhibit greater activity or not more than 
about 25-fold less and, preferably, not more than about tenfold less activity, and most 
preferably, not more than about three-fold less activity relative to the polypeptide of the 
present invention.) 

25 Polynucleo tides and Polypeptides of the Invention 

FEATURES OF PROTEIN ENCODED BY GENE NO: 1 

This gene is expressed primarily in melanocytes and, to a lesser extent, in 
testes, ovary, kidney and other tissues. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancer, disorders of neural crest derived cells including pigmentation, 
defects, melanoma, reproductive organ defects, and defects of the kidney. Similarly, 

35 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the skin. 
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reproductive, and renal systems, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
5 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating disorders that arise from alterations in 
the number or fate of neural crest derived cells including cancers such as melanoma and 
10 defects of the developing reproductive system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

This gene is expressed primarily in infant brain and fetal lung. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

15 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental disorders of the brain or lung. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

20 of the above tissues or cells, particularly of the central nervous and pulmonary systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 

25 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating or diagnosing disorders associated 
with abnormal proliferation of cells in the Central nervous system and developing lung. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 3 

This gene is expressed primarily in breast lymph node and to a lesser extent in 

ovarian cancer and chondrosarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune responses such as inflanunation or immune surveillance for 



wo 98/54963 



PCT/US98/11422 



7 

tumors. This gene may be important for inflammatory responses associated with 
tumors. Similarly, polypeptides and antibodies dkected to these polypeptides are useful 
in providing inmiunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
5 immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
10 individual not having the disorder. Preferred epitopes include those comprising a 

sequence shown in SEQ ID NO: 236 as residues: Lys-45 to Val-50, Lys-69 to Arg-76. 

The tissue distribution indicates that polynucleotides and polypeptides 
correspondmg to this gene are useful for treatment or diagnosis of immune responses 
including those associated witii tumor-induced inflammation. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 4 

This gene is expressed primarily in T-cells and T-cell lymphomas. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immunilogical diseases involving T-cells such as inflammation, 
autoimmunity, and cancers including T-cell lymphomas. Similarly, polypeptides and 
antibodies directed to tiiese polypeptides are usefiil in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

25 of the above tissues or cells, particularly of T-cells and other cells of die unmune 

system, expression of tiiis gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 

30 expression level, i.e., the expression level in healtiiy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene arc useful for diagnosing and treating T-cell based disorders 
such as inflanmiatory diseases, autoinunune disease and tumors including T-cell 

35 lymphomas. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 5 
This gene is expressed primarily in activated monocytes. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
5 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammation, autoinununity, infection, or disorders involving activation 
of monocytes. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing inmiunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

10 the immune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

15 individual not having the disorder. Prefened epitopes include those comprising a 
sequence shown in SEQ ID NO: 238 as residues: Asp-19 to Arg-3 1. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosing or treating diseases that result in 
activation of monocytes including infections, inflammatory responses or autoimmune 

20 diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 6 

The translation product of this gene shares sequence homology with terminal 
deoxynucleotidyltransferase which is thought to be important in catalyzing the 

25 elongation of oligo- or polydeoxynucleotide chains. 

This gene is expressed primarily in activated human neutrophils. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

30 not limited to, cancer, particularly those of tiie blood such as leukemia and deficiencies 
in neutrophils such as neutropenia. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the cardiovascular system, expression of this gene at 

35 significandy higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
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such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to terminal deoxynucleotidyltransferase 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
treatment and differential diagnosis of acute leukemia's. Alternatively, this gene may 
function in the proliferation of neutrophils and be useful as a treatment for neutropenia, 
for example, following neutropenia as a result of chemotherapy. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

The contig exhibits a reasonable homology to the human chorionic gonadotropic 
(HCG) analogue-GT beta-subunit as disclosed in U.S. Patent No. 5,508,261 and PCT 
Publication No. WO 92/22568. There is a high degree of conservation of the 
structurally important cysteine residues in these identities. 

This gene is expressed primarily in IL-1 and LPS induced neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of die inunune system, including inflanunatory diseases and 
allergies. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of tiiis gene at significantly higher or lower levels may 
be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to die standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not havmg die disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treattnent/diagnosis of diseases of the immune 
system since expression is primarily in neutrophils, and may be useful as a growth 
factor for the differentiation or proliferation of neutrophils for the treatment of 
neutropenia following chemotherapy. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 8 

This gene is expressed primarily in IL-1- and LPS-induced neutrophils. 
Therefore, polynucleotides and polypeptides of die invention are useful as 

reagents for differential identification of die tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions, which include,, but are 
not limited to, diseases of the immune system, including inflanunatory diseases and 
allergies. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 

5 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

10 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 241 as residues: Ser-14 to Pro-22, Leu-43 to Val-53. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of diseases of the 

15 immune system since expression is primarily in neutrophils, and may be useful as a 
growth factor for the differentiation or proliferation of neutrophils for the treatment of 
neutropenia following chemotherapy. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 9 

20 This gene is expressed primarily in IL- 1 and LPS induced neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of the immune system, including inflanmiatory diseases and 

25 allergies. Similarly, polypeptides and antibodies directed to these polypeptides are 

useftil in providing inmiunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the inmiune system, expression of this gene at significantiy higher or lower levels may 
be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 

30 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken fkom an mdividual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 242 as residues: Tyr-22 to His-35. 

35 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treatment/diagnosis of diseases of the immune 
system since expression is primarily in neutrophils, and may be useful as a growth 
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factor for the differentiation or proliferation of neutrophils for the treatment of 
neutropenia following chemotherapy. 

FEATURES. OF PROTEIN ENCODED BY GENE NO: 10 
5 This gene is expressed primarily in activated T-cells and to a lesser extent in 

endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

10 not limited to, immune dysfunctions including cancer of the T lymphocytes and 
autoimmune disorders and inflammation. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system, expression of this gene at 

1 5 significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

20 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for tteatment and diagnosis of immune disorders 
particularly of T-cell origin and may act as a growth factor for particular subsets of T- 
cells such as CD4 positive cells which would make this a useful therapeutic for the 
treatment of HTV and other inmiune compromising illnesses. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 11 
This gene is expressed primarily in fetal tissue. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

30 biological sample and for diagnosis of many developmental abnormalities. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the developing fetus, 
expression of this gene at significantly higher or lower levels may be routinely detected 

35 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken fi-om 
an individual having such a disorder, relative to the standard gene expression level, i.e.. 
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the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful as a growth factor or differentiation factor for 
5 particular cell types in the developing fetus and may be useful in replacement or other 
types of therapy in cases where the gene is expressed aberrantly. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 12 

This gene is expressed primarily in T-cells and to a lesser extent in tumor tissue 

10 including glioblastoma, meningioma, and Wilm's tumor. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of the immune system including autoimmune conditions such as 

1 5 rheumatoid arthritis, inflanmiatory disorders and cancer. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

20 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 245 as residues; 

25 Thr-9toSer-14. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis/ modulation of immune function 
disorders, including rheumatoid arthritis and inflammatory responses. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 13 

This gene is expressed primarily in placenta and to a lesser extent in fetal liver 
and bone marrow. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
leagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of hematological disorders. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing inmiunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
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disorders of the above tissues or cells, particularly of the hematological and immune 
systems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful as a growth factor for hematapoietic stem cells or 
progenitor cells in the treatment of chemotherapy patients or kidney disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 14 
This gene is expressed primarily in stromal cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of hematapoietic disorders including cancer, 
neutropenia, anemia, and thrombocytopenia. Sunilarly, polypeptides and antibodies 
duTccted to these polypeptides are useful in providing inmiunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the hematapoietic and inunune, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
coixesponding to this gene are useful as a growth factor for hematapoietic stem cells or 
progenitor cells, in particular following chemotherapy treatment. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 15 

The translation product of this gene shares sequence homology, with epsilon- 
COP from Bos taurus which is thought to be important as a component of coatomer, a 
complex of seven proteins, that is the major component of the non-clathrin membrane 
coat. Preferred polypeptides encoded by this gene comprise the following amino acid 

sequences: 

MAPPAPGPASGGSGEVDELFDVKNAFYIGSYQQCINEAXXVKLSSPERDVERD 
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VFLYRAYLAQRKFGVV1J)EIKPSSAPELQAVRMFADYLAHES 
MSRSXDVrmTFUJ^AASI^ 

RLD1JU^LKRMQDU)EDATLTQIATAWVS1ATGGE^^ 
PTLLLLNGQAACHMAQGRWEAAEGLLQEALDKDSGYPETLVNLIV^ 
5 PEVTNRYLSQLKDAHRSHPHKEYQAKENDFDRLVLQYAPSAEAGPELSGP 
(SEQ ID NO:458); or RDVERDVFLYRAYLAQRKFGVVLDEIKPSSAPELQAVRMF 
ADYLAHESRRDSIVAELDREMSRSXDVTNTTFLLMAASIY^ 
QGDSLECrAMTVQIU.KLDRLDLARKELKRMQDLDEDATLTQLA^^ 
GEKLQDAYYIFQEMADKCSPTLIJXNGQAACHMAQGRWEAAEGLLQEA^^ 

10 SGYPETLVNLIVLSQHLGKPPEVTNRYLSQLKDAHRSHPFIKEYQAKENDFDRL 
VLQYAPS A (SEQ ID NO:459). 

This gene is expressed primarily in activated monocytes and T-cells, and to a 
lesser extent in multiple other tissues. 

Therefore; polynucleotides and polypeptides of the invention are useful as 

15 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inununomodulation, specifically relating to transport problems in these 
cells. Similarly, polypeptides and antibodies directed to these polypeptides are usefiil in 
providing immunological probes for differential identification of the tissue(s) or cell 

20 type(s). For a number of disorders of the above tissues or cells, particularly of the 

inmiune, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 

25 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to epsilon-COP indicates that 
polynucleotides and polypeptides corresponding to this gene are usefiil for treating 
/diagnosing problems with the cellular transport of proteins that may result in 

30 immunologic dysfunction. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 16 

The translation product of this gene shares sequence homology with an RNA 
helicase which is thought to be important m polynucleotide metabolism. The translation 
35 product of this contig exhibits good homology to the LbeIF4A antigen of Leishmania 
braziliensis. The LbeIF4A antigen, or immunogenic portions of it, can be used to 
induce protective inununity against leishmaniasis, specifically L. donovani, L. chagasi, 
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L. infantum, L. major, L. braziliensis, L. panamensis, L. tropica and L. guyanensis. It 
can also be used diaghostically to detect Leishmania infection or to stimulate a cellular 
and/or humoral immune response or to stimulate the production of interleukin-12. 

This gene is expressed primarily in colon cancer and to a lesser extent in 
pituitary. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of cancers particularly of the colon. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the gastrointestinal 
system, expression of this gene at significandy higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and. wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 249 as residues: Glu-93 to Ala-98. Gln-150 to Leu- 
156, Leu-220 to Leu-231, Leu-268 to Arg-273, Val-324 to Pro-341, Arg-372 to Asn- 
380, Ser-405 to Gly-410, Phe-426 to Ala-433. Glu-458 to Asp.470, Arg-506 to Ser- 
547. 

The tissue distribution and homology to RNA helicase indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for development 
of diagnostic tests for colon cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 17 

The translation product of this contig has sequence homology to a cytoplasmic 
protein that binds specifically to JNK designated the JNK interacting protein- 1 or JIP-1 
in mice. JIP-1 caused cytoplasmic retention of JNK and inhibition of JNK-regulated 
gene expression. 

This gene is expressed primarily in brain including pituitary cerebellum frontal 
cortex, fetal brain and to a lesser extent in the kidney cortex. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of the central nervous system disorders including 
ischemia, epilepsy, Parkinson's disease, and schizophrenia. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providmg immunological 
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probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the central nervous system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
5 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Furthermore, the translation product of this contig may suppress the effects of 
the JNK signaling pathway on cellular proliferation, including transformation by the 
10 Bcr-Abl oncogene. Preferred epitopes include those comprising a sequence shown in 
SEQ ID NO: 250 as residues: Pro-6 to Ser-26, Ala-30 to Asp-41, Gly-55 to Ser-61, 
Gly-74 to Thr-80, Tyr-117 to Ala-123, Tyr-167 to Asp-172, Ala-212 to Cys-223, Pro- 
239toTyr-244. 

The tissue distribution indicates that polynucleotides and polypeptides 
15 corresponding to this gene are useful for enhanced survival and/or differentiation of 
neurons as a treatment for neurodegenerative disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 18 

The translation product of tfiis gene shares sequence homology with a liver 

20 stage antigen from a protozoan parasite. 

This gene is expressed primarily in fetal tissue and to a lesser extent in activated 
T-cells and other immune cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental abnormalities and diseases of immune function. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 

30 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual havmg such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

35 disorder. 
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The tissue distribution and homology to a protozoan antigen indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for 
treatment/immune modulation of parasitic infections. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 19 

Preferred polypeptide encoded by this gene comprise the following polypeptide 
sequences: 

MKAIGIEPSLATYHHIIRLFDQPGDPLKRSSFIIYDIMNELMGKRFSPKD 

PDDDKFFQSAMSICSSLRDLELAYQVHGLLKTGDNWKHGPDQHRNFYYSKFF 

DLI(XMEQroVTLKWYEDLIPSAYFPHSQTMmLLQALDVAN^ 

(SEQ ID NO:460); and/or KDSKEYGHTFRSDLREEILMLMARDKHPPELQVAF 

ADCAADIKSAYESQPIRQTAQDWPATSLNCIAILFLRAGRTQEAWKMLGLFRKH 

NKIPRSELLNELMDSAKVSNSPSQAIEVVELASAFSLPICEGLTQRVMSDFAINQ 

EQKEALSNLTALTSDSDTDSSSDSDSDTSEGK (SEQ ID NO:461). Polynucleotides 

encoding such polypeptides are also provided. 

This gene is expressed prin^rily in stromal and CD34 depleted bone marrow 
cells and to a lesser extent in tissues of embryonic origin. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of hematologic origin including cancers and immune 
dysfunction. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of tiie tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
die hematapoietic and immune, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 252 as residues: Ser-28 to Gln-34. 

The tissue distribution indicates tiiat polynucleotides and polypeptides 
corresponding to tiiis gene are useful as a growth factor for hematopoietic stem cells or 
progenitor cells which may be useful in the treatment of chemotherapy patients 
suffering from neutropenia. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 20 

Preferred polypeptide fragments can be found in an alternative open reading 
frame. These preferred polypeptides comprise the amino acid sequence: 
MSSDNESDEDEDLKl^RRLRDKHLmQDLQSRQKHEIESLYTKLGKVPPAVI 
5 IPPAAPLSGRRRRPTKSKGSKSSRSSSLGNKSPQLSGNLSGQSAASVLHPQQTL 
HPPGNIPESGQNQLLQPLKPSPSSDNLYSAFTSDGAISVPSLSAPGQGTSSTNTV 
GATVNSQAAQAQPPAMTSSRKGTFTDDLHKLVDNWARDAMNLSGRRGSKGH 
MNYEGPGMARKFSAPGQLCISMTSNLGGSAPISAASATSLGHFTKSMCPPQQY 
GFPATPFGAQWSGTGGPAPQPLGQFQPVGTASLQNFNISNLQKSISNPPGSNL 

10 RTF (SEQ ID NO:462); IQDLQSRQKHEIESLYTKLGKVPPAVIPPAAPLSGRRRR 
PTKSKGSKSSRSSSLGNKSPQLSGNLSGQSAASVLHPQQTLHPPGNIPESGQN 
QLLQPLKPSPSSDNLYSAFTSDGAISVPSLSAPGQGTSST (SEQ ID NO:463); 
TSIXjAISVPSLSAPGQGTSSTNTVGATVNSQAAQAQPPAMTSSRKGTFTDDLH 
(SEQ ID NO:464); KGHMNYEGPGMARKFSAPGQLCISMTSNLGGSAPIS AAS 

15 ATSLGHFTK (SEQ ID NO:465); QPLKPSPSSDNLYSAFTSDGAISVPSLSAPG 
(SEQ ID NO:466). Also preferred are polynucleotide fragments encoding these 
polypeptide fragments. 

This gene is expressed in fetal liver and tissues associated with the CNS. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, liver and CNS diseases. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 

25 tissues or cells, particularly of the liver and CNS, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fiuids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

30 in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 253 as residues: 
Gln-26 to Lys-34. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment for liver diseases such 

35 as hepatocellular carcinomas and diseases of the CNS. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 21 

In an alternative reading frame, this gene shows sequence homology to two 
recently cloned genes, kaiyopherin beta 3 and Ran^GTP binding protein 5. (See 
Accession Nos. gil2 102696 and gnllPIDIe32873 1 .) The Ran_GTP binding protein is 
5 related to importin-beta, the key mediator of nuclear localization signal (NLS)- 

dependent nuclear transport. Based on homology, it is likely that this gene may activity 
similar to the RAN^GTP binding protein. Preferred polypeptide fragments comprise the 
amino acid sequence: WVAAAESMXLLLECAXVRGPEYLTQMWHFMCDALIKA 
IGTEPDSDVLSEIMHSFAK (SEQ ID NO:467). Also preferred are polynucleotide 
10 fragments encoding these polypeptide fragments. 

This gene is expressed in thymus tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

15 not limited to, immune disorders. Similarly, polypeptides and antibodies directed to 
tiiese polypeptides are usefiil in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routmely detected in certain tissues (e.g., 

20 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to tiie standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for diagnosis and treatment for immune disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 22 

This gene is expressed primarily in prostate and osteoclastoma tissues. 

Preferred polypeptide fragments also comprise the amino acid sequence: 
30 MEINNQN(:HVroLVRTVMENGVEGLLffGAFU>ESWUGVRCSSEPPKA^ 

AHSQKRRLDGWSHRHLRVHYCVSLTIHFS (SEQ ID NO:468). Also prefen^ are 

polynucleotide sequences encoding this polypeptide fragment. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions, which include, but are 

not limited to, bone and prostate diseases, and cancers, particularly of die bone and 

prostate. Similarly, polypeptides and antibodies directed to these polypeptides are 
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useful in providing inununological probes for differential identification of the.tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the bone and prostate systems, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 255 as residues: Met-1 to Ser-1 1. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment for bone and prostate 
disorders, especially cancers of those systems. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 23 

This gene shares sequence homology with the FK506-binding protein (FKBP- 
13) family, a known cytosolic receptor for the immunosuppressants. Recently, another 
group has cloned a very similar gene, recognizing the homology to FK506-binding 
protein family, calling their gene FKBP23. (See Accession No. 2827255.) 
This gene is expressed primarily in lymphoid tissues. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample, especially for those susceptible to immune suppressant therapies and 
for diagnosis of diseases and conditions, which include, but are not limited to, immune 
suppressant disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inununological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the inunune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 256 as residues: Ala-19 to Val-31, Arg- 
38 to GIy-49, Ala-61 to Lys-66, Tyr-68 to Pro-78, Gly-1 16 to Ala-121, Asp-154 to 
Ser-162, Glu-173 to Ghi-186, Phe-194 to Gly-203, Pro-207 to Val-212. 
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The tissue distribution and homology to FKBP- 1 2 and - 1 3 indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment for inunune suppressant disorders. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 24 

This gene is expressed primarily in the brain and in the retina. This gene maps 
to chromosome 8, and therefore can be used in linkage analysis as a marker for 
chromosome 8. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

10 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurological and ocular associated disease states. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing inununological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 

1 5 disorders of the above tissues or cells, particularly of tiie disorders of the central 

nervous system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to die standard 

20 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 257 as residues: Cys-34 to Asp-40. 

The tissue distribution in retina indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and/or detection of eye disorders 

25 including blindness, color blindness, impaired vision, short and long sightedness, 
retinitis pigmentosa, retinitis proliferans, and retinoblastoma. Expression in die brain 
indicates a role in the is useful for the detection/treatment of neurodegenerative disease 
states and behavioral disorders such as Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, schizophrenia, mania, dementia, paranoia, obsessive compulsive 

30 disorder and panic disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 25 

This gene shows sequence homology to a newly identified class of proteins 
expressed in die nervous system, called stathmin family. (See Accession No. 2585991; 
35 see also Eur. J. Biochem. 248 (3), 794-806 (1997).) The stathmin family appears to be 
an ubiquitous phosphoprotein involved as a relay integrating various intracellular 
signaling pathways. These pathways affect cell proliferation and differentiation. 
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Preferred polypeptide fragments comprise the amino acid sequence: 
QDKHAEEVRKNKELKEEASR (SEQ ID NO:469); QQDLSPWAAPVGCPLXXASX 
TCHXLPLSGCLRRQSXSLPVVAXLCFWFSCPLASLFVPGQPCVTCPFPSLPFQD 
KHAEEVRKNKELKEEASR (SEQ ID NO:470). Also prefeired are the 
5 polynucleotide fragments encoding these polypeptide fragments. 
This gene is expressed primarily in brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

10 not limited to, neurological disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the central nervous system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

15 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 
The tissue distribution indicates that polynucleotides and polypeptides 

20 corresponding to this gene are useful for the detection/treatment of neurodegenerative 
disease states and behavioral disorders such as Alzheimer's Disease, Parkinson's 
Disease, Huntintons Disease, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder and panic disorder. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 26 

The polynucleotide sequence of this gene contains a domain similar to a Flt3 
ligand peptide. Preferred polypeptide fragments comprise the amino acid sequence: 
PTRCCTTQPCRSSARRPCWVPMVPSPEGREXQPTCPS (SEQ ID NO:471). Thus, 
this gene may have activity as binding to Flt3 receptors, a process known to promote 
30 angiogenesis and/or lymphangiogenesis. 

This gene is expressed in human tonsil, and to a lesser extent in 
tefatocarcinoma, placenta, colon carcinoma, and fetal kidney. 

Therefore, polynucleotides and polypeptides of the invention are useful as - 
reagents for identification of the tissue(s) or cell type(s) present in a biological sample 
35 and for diagnosis of diseases and conditions, which include, but are not limited to, 
diseases of the tonsil, as well as cancers, such as colon, reproductive, and kidney 
cancers. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
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in providing immunological probes for differenUal identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
tonsils, colon, reproductive organs, and kidneys, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 259 as residues: 
Pro-22 to Glu-33. 

The tissue distribution in tonsil and several cancers and fetal tissues indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of diseases of the tonsil or colon, such as tonsillitis, 
inflammatory diseases involving nose and paranasal sinuses, especially during the 
infection of influenza, adenoviruses, pariainfluenza, rhinoviruses. The gene may also be 
useful in the diagnosis and treatment of neoplasms of nasopharynx or colon origins. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 27 

In an alternative reading frame exists a large open reading frame that encodes a 
preferred polypeptide. Preferred polypeptide fragments comprise the amino acid 
sequence: 

MKRSLNENSARSTAGCLPVPLFNQKKRNRQPLTSNPLKDDSGISTPSDNYDFP 

PLPTDWAWEAVNPEXAPVMKTVDTGQIPHSVSRPLRSQDSVFNSIQSNTGRSQ 

GGWSYRDGNKNTSLKTWXKNDFKPQCKRTNLVANDGKNSCPMSSGAQQQK 

QLRTPEPPNLSRNKETELLRQTHSSKISGCTMRGLDKNSALQTLKPNFQQNQY 

KXQMLDDIPEDNTLKETSLYQLQFKEKASSLRnSAVIESMKYWREHAQKTVLL 

FEVl^VUDSAVTPGPYYSKTFLMRDGKNTLPCVFi^mRELPR^ 

NYDQKKNIFCJCVSVRPAS VSEQKTFQAFVKIAD\^MQ^^ (SEQ ID 

NO:472); SQDS\TFNSIQSNTGRSQGGWSYRDGNKNTSLKTWXKNDFKPQCKR 

(SEQ ID NO:473); NKETELLRQTHSSKISGCTMRGLDKNSALQTLKPNF (SEQ ID 

NO:474);SSLRnSAVIESMKYWREHAQKTVllJEVlAV^ 

(SEQ ID NO:475); and PRLmCRVHRCVGNYDQKKNIFCJCVSVRPASVSEQKT 

FQAFV (SEQ ID NO:476). 

This gene is expressed primarily in human testes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, male reproductive disorders, including cancer. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the male reproductive system, 
5 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
10 disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful as a hormone with reproductive or other systemic 
functions; contraceptive development; male infertility of testicular causes, such as 
Kleinfeltens syndrome, varicocele, orchitis; male sexual dysfunctions; testicular 
1 5 neoplasms; and inflammatory disorders such as epididymitis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 28 
This gene is expressed primarily in apoptotic T-cell. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases relating to T cells, as well as cancer in general. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inununological probes for differential identification of the tissue(s) or cell type(s). For 

25 a number of disorders of the above tissues or cells, particularly of the disorders of the 
immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

30 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for inunune disorders. Moreover, since the gene 
was isolated from an apoptotic cell and based on the understanding of the relationship 

35 of appptosis and cancer, it is likely that this gene may play a role in the genesis of 
cancer. 



wo 98/54963 



PCT/US98/11422 



25. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 29 
This gene is expressed primarily in human tonsils. 
Therefore, polynucleotides and polypeptides of the invention are. useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, gastrointestinal disorders. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing inmiunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the gastrointestinal system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of gastromtestinal 
diseases. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 30 

The translation product of this gene shares sequence homology with C44C1.2 
gene product of Caenorhabditis elegans with unknown function. Preferred polypeptide 
fragments comprise the amino acid sequence: 

GVFRPCVCGRPASLTCSPLDPEVGPYCDTPTMRTLFNLLWLALACSPVHrn^ 
25 SDAKKAASKTIXEKSQFSDKPVQDRGLWTDLKAESVVLEHRSYCSAKARD 
FAGDVLGYVTPWSHGYDWKWGSKFTQISPVWLQLKRRGRElVff^ 
VDQGWMRA\TyCHAKGLfflWRIJJrEDWTYDDFRN\^ 
KNQHFDGFVVEVWNQII^QKRVGLIHMLTHLAEALHQARI^ 
DQLGMFraKEFEQLAPVUXjFSLMTYDYS 
. 30 KXKWRTKSSWGSTSNIXWTXRXPXDARXPVVGXRXIQXLKDHXP 

PQ (SEQ ID NO:477); TCSPLDPEVGPYCDTPTMRTIJNLLWLALACSPVHT^ 
(SEQ ID NO:478); LVVTDLKAES\^EHRS YCSAKARDRHFAGDVLG^^^ 
NSHGYDVTKVFGSKF (SEQ ID NO:479); REMFEVTGLHDVDQGWMRAVRK 
HAKGLHIVPRLOTDWTYDDFRNVLDSEDE (SEQ ID NO:480); HFDGFVVEVW 
35 NQLI^QKRVGLIHMLTHLAEALHQARLLALLVIPPAITPG^^ (SEQ ID 
NO:481);DGFSLMTYDYSTAHQPGPNAPLSWVRACVQVLDPKXKWRTKSSW 
GST (SEQ ID NO:482). Also preferred are polynucleotide fragments encodmg these 



10 
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polypeptide fragments. This gene maps to human chromosome 1 1 , and therefore is 
useful in linkage analysis as a marker for chromosome 11. 

This gene is expressed primarily in human T cells and to a lesser extent in 
human colon carcinoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune disorders and cancer. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune and gastrointestinal systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID ' 
NO: 263 as residues: Leu-2l to Ala-30, Ser-38 to Asp-47, Pro-87 to Asp-94, Leu-197 
to Thr-204, Pro-256 to Ser-262, Thr-277 to Arg-282, Thr-293 to Trp-303. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of immune 
disorders and gastrointestinal diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 31 

The translation product of this gene shares sequence homology with Ribosomal 
protein LU of Caenorhabditis elegans. (See Accession No. 156201.) Preferred 
polypeptide fragments comprise the amino acid sequence: 

ERGVSINQFCKEFNERTKDIKEGIPLPTKILVKPDRTFEIKIGQPTVSYFLK^ 
lEKGARQTGKEVAGLVTLKHVYEIARIKAQDEAFALQDWLSS 
GIRWKDLSSEELAAF QKERAIFLAAQKEADLAAQEEAAKK (SEQ ID NO:483). 
Also preferred are polynucleotide fragments encoding these polypeptide fragments. 

This gene is expressed in human embryo tissue and to a lesser extent in human 
epithelioid sarcoma and other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for identification of the tissue(s) or cell type(s) present in a biological sample 
and for diagnosis of diseases and conditions, which include, but are not limited to, 
development disorders and epithelial cell cancer. Similarly, polypeptides and antibodies 
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directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the embryonic and epithelial cell systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
5 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
10 NO: 264 as residues: Lys-34 to Gly-40. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of developmental 
disorders and epithelial cancer. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 32 
This gene is expressed primarily in resting T cells. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

20 not limited to, inflammatory and general immune disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inmiunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

25 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 
The tissue distribution indicates that polynucleotides and polypeptides 

30 corresponding to this gene are useful for the diagnosis and treatment of disorders of 
immune system.. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 33 

This gene is believed to reside on chromosome 1 . Accordingly, polynucleotides 
35 derived from this gene are useful in linkage analysis as chromosome 1 markers. 

This gene is expressed primarily in prostate and to a lesser extent in soarcs adult 

brain, human umbilical vein endothelial cells, and amniotic cells. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis oif diseases and conditions, which include, but are 
not limited to, prostate-related disorders. Similarly, polypeptides and antibodies 

5 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the urinary system and nervous system 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 

10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
15 for the diagnosis and treatment of disorders of the urinary and nervous systems. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 34 

This gene shares sequence homology with R05G6.4 gene product. (See Accession No. 

gill 326338.) This gene also shares sequence homology with the cyclophilin-like protein 

20 CyP-60. (See Accession No. 1 199598, see also Biochem. J. 314 (1), 313-319 
(1996).) Preferred polypeptide fragments comprise the amino acid sequence: 
AVYTYHEKKKDTAASGYGTQNIRLSRDAVKDFIX:C(XSLQPCHDPV^ 
YEREAE.EYILHQKKEIARQMKAYEKQRGTRREEQKELQRAASQDHVRGFLEKE 
SAIVSRP LNPFTAKALSGTSPDDVQPGPSVGPPSKDKDKVLPSFWIPSLTPEAK 

25 ATKLEKPSRTVTCPMSGKPIJRMSDLTPVHFTPLDSSVDRVGLITRSERY 
RDSl^NATPCAVLRPSGAVVTI^CVEKLIRKDMVDPWGDKLTDRDI^ 
(SEQ ID NO:484); YLYEREAILEYILHQKKEIARQMKAYEKQRGTRREEQKELQ 
RAASQDHVRGFLE (SEQ ID NO:485); and FTAKALSGTSPDDVQPGPS VGPP 
SKDKDKVLPSFWSLTPEAKATKLEKPSRTVTCPMSGKPL (SEQ ID NO:486). 

30 Also preferred are polynucleotide fragments that encode these polypeptide fragments. 

This gene is expressed primarily in human testis and to a lesser extent in other 

tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, male reproductive disorders and in particular testicular cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 



wo 98/54963 



PCT/US98/11422 



immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system. 
Expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
5 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not-having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
10 corresponding to this gene are useful for diagnosis and treatment of disorders of the 
male reproductive system and in particular of testicular cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 35 

The translation product of this gene shares sequence homology with Lpe5p of 

1 5 Saccharomyces cerevisiae which is thought to be important m the .metabolism of 
phospholipids. 

This gene is expressed primarily in liver and brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, metabohc disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the metabolic and nervous systems expression of this 

25 gene at significantiy higher or lower levels may be routinely detected in certain tissues 
(e.g.. cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., die 
expression level in healthy tissue or bodily fluid fix>m an individual not having the 

30 disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 268 as residues: Pro-14 to Leu-20, Lys-28 to Asn-38, Arg-109 to Arg-1 14, Lys- 
1 19 to Asn-124, Glu-152 to Leu-157, Pro-172 to Val-180. 

The tissue distribution and homology to LpeSp of Saccharomyces cerevisiae 
indicates that polynucleotides and polypeptides corresponding to this gene are usefid for 

35 the diagnosis and treatment of metabolic and nervous disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 36 
This gene shares sequence homology with the nuclear ribonucleoprotein U (HNRNP 
U), encoded by C. elegans (See Accession gill703576.) Preferred polypeptide 
fragments comprise the amino acid sequence: 
5 MDTSENRPENDVPEPPMPIADQVSNDDRPEGSVEDEEKKESSLPKSFKRKISVV 
SATKGVPAGNSDTEGGQPGRKRRWGASTATTQKKPSISnTESLKSLIPDIKPL 
AGQEAVVDLHADDSRISEDEIERNGDDGTHDKGLKICRTVTQVVPAEGQENGQ 
REEEEEEKEPEAEPPVPPQVSVEVALPPPAEHEVKKVTLGDTLTRRSISQQKSGV 
SITIDDPVRTAQVPSPPRGKISNIVHISNLVRPFTLGQLKELLGRTGTLVEEAFWI 

10 DKIKSHCFVTYST\^AVATRTALHGVKWPQSNPKFLCADYAEQDELDYHRGL 
LVDRPSETKTEEQGIPRPLHPPPPPPVQPPQHPRAEQREQERAVREQWAERERE 
MERRERTRSEREWDRDKVREGPRSRSRSRXRRRKERAKSKEKKSEKKEKAQE 
EPPAKliDDIJTlKTKAAPCIYWLPLTDSQIVQKEAERAERAKER^ 
EQKEREKEAERERNRQLEREKRREHSRERDRERERERERDRGDRDRDRERDRE 

15 RGRERDRRDTKRHSRSRSRSTPVRDRGGR (SEQ ID NO:488). Also preferred are 
the polynucleotide fragments encoding this polypeptide fragments. 
This gene is expressed primarily in epididymus. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of the male reproductive system. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the male reproductive system, expression of 

25 this gene at significandy higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spuial fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

30 disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are usefril for the diagnosis and treatment of male 
reproductive disorders. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 37 
This gene is expressed primarily in amygdala. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammatory diseases and reproductive disorders. Similarly, 
5 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the amygdala, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
15 corresponding to this gene are useful for diagnosis and treatment of inflammatory 
diseases and reproductive disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 38 

This gene shares sequence homology with human opsonin protein P35 

20 fragment. (See Accession No. R94 181.) The opsonin protein activates the phagocytosis 
of pathogenic microbes by phagocytic cells. Preferred polypeptide fragments comprise 
the amino acid sequence: GCDSCPPHLPREAFAQDTQAEGECSSRAERADMCPDAP 
PSQEVPEGPGAAP (SEQ ID NO:489). Also preferred are polynucleotide fragments 
encoding these polypeptide fragments, 

25 This gene is expressed in immune-related tissues such as thymus, macrophage, 

T cells and to a lesser extent in many other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

30 not limited to, inunune disorders and infectious disease. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inmiunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the inmiune system and infectious disease, 
expression of this gene at significanfly higher or lower levels may be routinely detected 

35 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e.. 
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the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 27 1 as residues: Lys-9 to Arg- 14, Mct-38 to Asp-5 1 . 

The tissue distribution indicates that polynucleotides and polypeptides 
5 corresponding to this gene are useful for diagnosis and treatment of imriiune disorders, 
as well as the treatment and/or diagnosis of infectious disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 39 

The translation product of this gene shares sequence homology with alpha-2 
10 type I collagen which is thought to be important in tissue repair. (See, e.g., 21 1607.) 
Preferred polypeptide fragments comprise the amino acid sequence: PQLPSCGRPW 
PGTASVFQSHTQGPREDPDPCRAQGSAGTHCPISLSPPRQ (SEQ ID NO:490). 
Also preferred are the polynucleotide sequences encoding these polypeptide sequences. 
This gene is expressed primarily in the brain and to a lesser extent in the kidney 
IS and thymus 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, brain, kidney, and immune disorders. Similarly, polypeptides and 

20 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the brain, kidney, and immune disorders, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 

25 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to alpha-2 type I collagen indicates that 
30 polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment of tissue repair, and brain, kidney, inrunune disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 40 

The translation product of this gene shares sequence homology with mini- 
35 collagen which is thought to be important in tissue repair tumor metastasis. (See 
Accession No. gnllPIDId 1006976.) Preferred polypeptide fragments comprise the 
amino acid sequence: PGFRGPSGSLGCSFFPRSLGRVLPPGCQRPGAHAD 
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SSPPPTP (SEQ ID NO:491). Also preferred are polynucleotides encoding this 
polypeptide fragment. 

This gene is expressed in ovarian cancer and to a lesser extent in dedritic cells 
and smooth muscle, 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not Umited to, tumor metastasis and tissue repair. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 

10 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the tumor metastasis and tissue repair, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or ceU sample taken from 

15 an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level m healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 273 as residues: Asn-2 to His-1 1. 

The tissue distribution and homology to mini-coUegen gene indicates that 

20 polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment of tumor metastasis and tissue repair. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 41 

This gene shares sequence homology with the HIV TAT protein. (See 

25 Accession No. 328416.) Preferred polypeptide fragments comprise the amino acid 
sequence: EDLKKPDPASLRAASCGEGKKRKACKNCTCGLAEELEKEK 
SREQMSSQPKSACGNCYUjDAFRCASCPYD3MPAFKPGEKVIiS (SEQ ID 
NO:492); EDLKKPDPASLRAASCGEGKKRKACKNCTCGLAEELEKEK 
SREQMSSQPKSACGNCYLGDAFRCASCPYLGMPAFKPGEKVLlJSDSNUro 

30 (SEQ ID NO:493); CGNCYLGDAFRCASCPYLGMPAFKPGEKVLLSDS 

(SEQ ID NO:494); SCGEGKKRKACKNCTCGLAEELEKE (SEQ ID NO:495); 
SQPKSAC GNCYLGDAFRCASC (SEQ ID NO:496); and REAGQNSERQYVS 
LSRD (SEQ ID NO:497). Also preferred are polynucleotide fragments encoding these 
polypeptide fragments. 

35 This gene is expressed primarily in the infant brain and to a lesser extent in the 

breast and testes. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, brain, testes and breast disorders. Similarly, polypeptides and antibodies 
5 directed to these polypeptides are useful in providing inununological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain, testes and breast disorders, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 

10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 274 as residues: Pro-7 to Val- 15. 

1 5 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnosis and treatment of brain, testes and 
breast, and other related disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 42 

20 This gene is expressed primarily in the infant brain, human cerebellum, and to a 

lesser extent in medulloblastoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

25 not limited to, brain related disorders and medulloblastoma and other brain cancers. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing inununological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
brain related disorders and brain cancers, including medulloblastoma, expression of this 

30 gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid firam an individual not having the 

35 disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 275 as residues: Thr-41 to Glu-47, 
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The tissue distribution indicates that polynucleotides and polypeptides . 
corresponding to this gene are useful for diagnosis and treatment of human brain related 
disorders, brain cancers, and meduUoblastoma. 



5 FEATURES OF PROTEIN ENCODED BY GENE NO: 43 

The translation product of this gene shares sequence homology with a 
phosphotyrosine-independent ligand for the Ick SH2 domain which is thought to be 
important in signal transduction related to phosphotyrosine-independent ligand for the 
Ick SH2 domain. (See Accession No. gill 184951.) Preferred polypeptide fragments 

10 comprise the amino acid sequence: ESSGQARTLADPGPGWPRQQGMCFGSLT 

GI^TTPHGFLTVSAEADPRLIESLSQMl^MGFSDEGGWLTRLLQTKNYDIGAAL 
DTIQYSKH (SEQ ID NO:498). Also preferred are polynucleotide fragments encoding 
this polypeptide fragment. It is likely that this gene is a new member of a family of 
phosphotyrosine-independent ligands for the Ick SH2 domains. 

1 5 This gene is expressed primarily in the placenta and to a lesser extent in 

endothelial cells and neutrophil. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

20 not Hmited to. reproductive, cardiovascular, immune, and infectious diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
cardiovascular, reproductive, and immune system, and infectious diseases, expression 

25 of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinsil fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

30 disorder. 

The tissue distribution and homology to a phosphotyrosine-mdependent ligand 
for the Ick SH2 domam indicates that polynucleotides and polypeptides correspondmg 
to this gene are useful for diagnosis and treatment of cardiovascular, reproductive, and 
immune system diseases, as well as infectious diseases. 



35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 44 

This gene is expressed primarily in the fetal brain, cerebellum and to a lesser 
extent in the placenta. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neuronal cell related disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 

10 the above tissues or cells, particularly of the neuronal cell related disorders, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 

15 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 277 as residues: Thr-20 to Gly-28. 

The tissue distribution and homology to proline-rich protein genes indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 

20 and treatment of neuronal cell related disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 45 

The translation product of this gene shares sequence homology with 
precerebeilin of human, which is thought to be important in synaptic physiology. (See 

25 Accession No. gil 1 8025 1 .) It has been observed that cerebellin-like immunoreacti vity is 
associated with Purkinje cell postsynaptic stmctures. Thus, it is likely that this gene 
also have synaptic activity. Preferred polypeptide fragments comprise the amino acid 
sequence: QEGSEPVLLEGECLWCEPGRAAAGGPGGAALGEAPPGRVAFXAV 
RSHHHEPAGETGNGTSGAIYFDQVLVNEGGGFDRASGSFVAPVRGVYSFRFH 

30 VVKVYNRQTVQVSLMLNTWPVISAFANDPDVTREAA^^ 

LRRGXSTGW (SEQ ID NO:499). Also preferred are polynucleotide fragments 
encodmg these polypeptide fragments. 

This gene is expressed primarily in cerebellum and infant brain. By Northern 
analysis, a single transcript of 2.4 kb was observed in brain tissues. 

35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, neuronal cell signal transduction and synaptic physiology. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the neuronal cell 
5 signal transduction and synaptic physiology expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
10 or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to gene or gene family indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment of neuronal cell related disorders. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 46 

This gene is expressed in fetal liver and spleen, and to a lesser extent in bone 
marrow, umbilical vein, and T cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, disorders of the immune system, particularly hematopoiesis. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the hematopoiesis 

25 and immune disorders, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 

30 fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 279 as residues: AspOO to Glu-57. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of hematopieotic and 
immune disorders. 



35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 47 

The translation product of this gene shares sequence homology with a 12 kD 
nucleic acid binding protein of Feline calcivirus which is thought to be important in viral 
replication. (See Accession No. 59264) 
5 This gene is expressed primarily in human cardiomyopathy and to a lesser 

extent in T helper cells, fetal brain and synovial sarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

10 not limited to, cardiomyopathy as well as viral infection. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of tiie tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the cardiovascular system, expression of 
this gene at significantiy higher or lower levels may be routinely detected in certain 

15 tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or anottier tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e.. 
the expression level in healtiiy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 

20 NO: 280 as residues: Trp-20 to Cys-26. 

The tissue distribution in cardiomyopathy and homology to viral 12 kD nucleic 
acid binding protein indicates tiiat polynucleotides and polypeptides corresponding to 
this gene are useful for diagnosis and intervention of cardiomyopathy, including those 
caused by ischemic, hypertensive, congenital, valvular, or pericardial abnormalities. 

25 The gene expression pattern may be the consequence or die cause for tiiese conditions. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 48 

The translation product of tius gene shares sequence homology with tumor 
necrosis factor related gene product which is thought to be important in tumor necrosis, 
30 bacterial and viral infection, inunune diseases and inmiunoreactions. 

This gene is expressed primarily in colon and to a lesser extent in ovarian and 
N breast cancers. 

Therefore, polynucleotides and polypeptides of tiie invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, tumors of colon, ovary or breast origins. Similarly, polypeptides and 
antibodies directed to tiiese polypeptides are useful in providing immunological probes 
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for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the colon, ovary and breast, expression of ' 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to Tumor necrosis factors indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for intervention 
of cancers of colon, ovary and breast origins, because TNF fanfiily members are known 
to be involved in the tumor development. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 49 

The translation product of this gene shares sequence homology with mucins, 
such as epithelial mucin, which is thought to be irriportant in exuracellular matrix 
functions such as protection, lubrication and cell adhesion (See for example Accession 
No. R68002). Preferred polypeptide fragments comprise the following amino acid 
sequence: PRSRPALRPGRQRPPSHSATSGVLRPRKKPDP (SEQ ID NO:500). 
Also preferred are polynucleotide fragments encodmg these polypeptide fragments. 
Moreover, this gene maps to chromosome 22ql L2-qter, and therefore, can be used as 
a marker in linkage analysis for chromosome 22. 

This gene is expressed primarily in corpus colosum. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, tumors, especially of corpus colosum, as well as metastatic lesions. 
Siniilarly, polypeptides and antibodies duected to these polypeptides are useful in 
providing inmiunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
corpus colosum and other solid tissues, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 
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The tissue distribution and homology to mucins indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for serum tumor markers or 
immunotherapy targets because tumor cells have greatly elevated level of mucin - 
expression and shed the molecules into the epithelial tissues. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 50 

This gene is expressed primarily in CD34 depleted buffy coat coid blood and 
primary dendritic cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

10 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, hematopoietic disorders and immunological disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inmiunological probes for differential identification of the tissue(s) or cell type(s). For 

1 5 a number of disorders of the above tissues or cells, particularly of the hematopoietic and 
inunune systems, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

20 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in CD34 depleted buffy coat cord blood and primary 
dendritic cells indicates that polynucleotides and polypeptides corresponding to this 
gene are useful for diagnosis and treatment of hematopoietic and inmiune, disorders. 

25 Secreted or cell surface proteins in the above tissue distribution often are involved in 
cell activation (e.g. cytokines) or molecules involved in cell surface activation. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 51 

The translation product of this gene shares sequence homology with Interferon 

30 induced 1-8 gene encoded polypeptide which is thought to be important in binding to 
retroviral rev responsive element. Preferred polypeptide fragment comprise the 
following amino acid sequences: MTLITPSXKLTFXKGNKSWSSRACSSTLVDP 
(SEQ ID NO:501). Also preferred are polynucleotide fragments encoding these 
polypeptide fragments. 

35 ' This gene is expressed primarily in CD34 positive cells and neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, retroviral infection, such as AIDS, and other immune disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
5 type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
, routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

10 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 284 as residues: Gln-51 to Trp-62. 

The tissue distribution and homology to interferon induced gene 1-8 indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 

15 intervention of retroviral infection including HTV. The factor may be involved in viral 
stability or viral entry into the cells. Alternatively, the virus/factor complex may elicit 
the cellular immune reaction. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 52 

20 This gene shares sequence homology to immunoglobulin lambda chain (See 

Accession No. 2865484). Therefore it is likely that this gene has activity similar to an 
immunoglobulin lambda chain. Preferred polypeptide fragments comprise the following 
amino acid sequence: GHPSPALSIAPSDGSQLPCDEVPYGEAHVTRYCKKPLTNS 
HLETEAQSSSL (SEQ ID NO:502). Also prefeired are polynucleotide fragments 

25 encoding these polypeptide fragments. 

This gene is expressed primarily in Hodgkin's lymphoma. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

30 not limited to, Hodgkin's lymphoma and other immune disorders. Similarly, 

polypeptides and antibodies directed to these polypeptides arc useful in providing 
inununological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 

35 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e,g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e.. 
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the expression level in healthy tissue or bodily fluid from an individual not haying the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO; 285 as residues: Pro-27 to Thr-32. 

The tissue distribution in Hodgkin's lymphoma and the sequence homology 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis of Hodgkin's lymphoma, since the elevated expression and secretion by the 
tumor mass may be indicative of tumors of this type. Additionally the gene product may 
be used as a target in the immunotherapy of the cancer.Because the gene is expressed in 
cells of lymphoid origin, the natural gene product may be involved in immune 
functions. Therefore it may be also used as an agent for immunological disorders 
including arthritis, asthma, immune deficiency diseases such as AIDS, and leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 53 

This gene has extensive homology to cDNA for Homo sapiens mRNA for the 
ISLR gene(See Accession No. AB003 184). This protein is considered to be a new 
member of the Ig superfamily and contains a leucine-rich repeat (LRR) with conserved 
flanking sequences and a C2-type immunoglobulin (Ig)-like domain. These domains are 
important for protein-protein interaction or cell adhesion, and therefore it is possible that 
the novel protein ISLR may also interact with other proteins or cells. The ISLR gene 
was mapped on human chromosome 15q23-q24 by fluorescence in situ hybridization 
(See Medline Article No. 97468140). Homology to the ISLR gene has been confirmed 
by another independent group as well (See Accession No. Hs. 102 171) 

This gene is expressed in a number of tissues including human retina, hearty 
skeletal muscle, prostate, ovary, small intestine, thyroid, adrenal cortex, testis, 
stomach, spinal cord, fetal lung and fetal kidney tissues, colon, tonsil and stomach 
cancer, and to a lesser extent in endometrial stromal cells treated with estradiol, breast 
tissue, synovium, lymphoma, and number of other tumors. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, tumors of colon, ovary and breast origins. However, due to the wide 
range of expression in various tissues, protein may play a vital role m the development 
of cancer in other tissues as well, not just those mentioned above. Similarly, 
polypeptides and antibodies directed to these polypeptides are usefiil in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the colon, ovary and 
breast, expression of this gene at significantly higher or lower levels may be routinely 
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detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
5 individual not having the disorder. Additionally, this gene maps to chromosome 15q23- 
q24, and therefore, can be used as a marker in linkage analysis for chromosome 15. 

The tissue distribution in tumors of colon, ovary, and breast origins indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and intervention of these tumors, in addition to other tumors where 
10 expression has been indicated. Protein, as well as, antibodies directed against the 

protein may show utility as a tumor marker and/or inmiunotherapy targets for the above 
listed tumors and tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 54 

15 Gene has homology to multidrug resistance gene 1 (See Accession No. 

P06795). Preferred polynucleotide fragments comprise the following sequence: 
GCTTCGTGTCCAACCCTCITGCCCTTCGCCTGTGTGCCTGGAGCCA 
CCACGCTCGCGTTTCCTCCTGTAGTGCTCACAGGTCCCAGCACCGATGGCA 
TTCCCTTTGCCCTGAGTCTGCAGCGGGTCCCTTTTGTGCrrCC^ 

20 GGTAGCCTCTCrCCCCCTGGGCCACTCCCGGGGGTGAGGGGGTTACCCCTT 
CCCAGTGTTTTTTATTCCTGTGGGGCTCACCCCAAAGTATT>^^ 
GTAA (SEQ ID NO:503). Also preferred are polypeptide fragments encoded by these 
polynucleotide fragments. 

This gene is expressed primarily in lung, esophagus, leukemia (Jurkat cells) and 

25 breast cancers and to a lesser extent in macrophages treated with GM-CSF fetal tissues 
and wide range of tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of die tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

30 not limited to, cancer of wide range of origms. Similarly, polypeptides and antibodies 
dkected to these polypeptides are usefiil in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the solid tumors, lung and leukemia, 
expression of diis gene at significantly higher or lower levels may be routinely detected 

35 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
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the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Furthermore, due to the high expression level in lung tissue and the proposed 
function of the multidrug resistance protein 1 gene as the efflux pump responsible for 
low-drug accumulation in multidrug-resistant cells, protein as well mutants thereof, 
5 may also be beneficial as a target for gene therapy, particularly for the chronic patient. 
Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 287 as 
residues: Met-1 to Lys-16. 

The tissue disuibution in wide range of cancers and fetal tissues indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for detection of 
10 cells in active proliferation, such as cancers. The gene products may be used for cancer 
markers or inmiunotherapy target. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 55 
This gene maps to the X chromosome. 

15 This gene is expressed primarily in the brain and to a lesser extent in the 

developing embryo. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

20 not limited to, neurodegenerative disease states and developmental disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders, including sex-linked disorders, of the above tissues or cells, 
particularly of the neurological, developmental systems, and cardiovascular system, 

25 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

30 disorder. Moreover, this gene maps to the X chromosome, and therefore, may be used 
as a marker in linkage analysis for this chromosome. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection/treatment of neurodegenerative 
disease states and behavioral disorders such as Alzheimer's Disease, Parkinson's 

35 Disease, Huntington's Disease, Klinefelter's, schizophrenia, mania, dementia, 

paranoia, obsessive compulsive disorder and panic disorder. In addition, the gene or 
gene product may also play a role in die treatment and/or detection of developmental 
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disorders associated with the developing embryo, sexually-linked disorders, or 
disorders of the cardiovascular system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 56 
5 The translation product of this gene shares sequence homology with paxillin 

which is thought to be important in mediating signal transduction from growth factor 
receptors to the cytoskeleton. Preferred polynucleotide fragments comprise the 
following sequence: TGGCTCACTGTCTTACAATCACTGCTGTGGAATCATGA 
TACCACTTITAGCTCriTGCATCrrCOTCAGTGTATTriT 

10 AAGTAGATTITAACrGGACAACriTGAGTACTGACATCATrGATAAATAAACT 
GGCTTGTGGTTTCAA (SEQ ID NO:506). Also preferred are polypeptide fragments 
encoded by these polynucleotide fragments. More preferably, polypeptide fragments 
comprise the amino acid sequence: LDELMAHLTEMQAKVAVRAD 
AGKKHLPDKQDHKASLDSMLGGLEQELQDLGIATVPKGHCASCQKPIAGKVI 

15 HALGQSWHPEHFVCTHCKEEIGSSPFFERSGLXYCPNDYHQLFSPRCAYCAAP 
ILDKVLTAMNQTWHPEHFFCSHCGEVFGAEGFHEKDKKPYCRKDFLA 
CGGCNRPVLENYLSAMDTVWHPECFVCGDCFTSFSTGSFFELDGRPFCELHYH 
HRRGTLCHGCGQPITGRCISAMGYKFHPEHFVCAFCLTQLSKGIFREQNDKTY 
CQPCFNKLF (SEQ ID NO:507); KASLDSMLGGLEQELQDLGIATVPKGHC 

20 ASCQKPIAGKVIHAL (SEQ ID NO:508); CPNDYHQLFSPRCAYCAAPILDKVL 
TAMNQTWHPEHFFCSHCGEVFGAEG (SEQ ID NO:509); DKKPYCRKDFLAM 
FSPKCGGCNRPVLENYLSAMDTVWHPECFVCGDCFTSFSTGSFFELDGRPFCE 
L (SEQ ID NO:510); CGQPITGRCISAMGYKFHPEHFVCAFCLTQLSKGIFRE 
QNDKTYCQ (SEQ ID NO:5 1 1 ). Polynucleotide fragments encoding these preferred 

25 polypeptide fragments are also contemplated. 

This gene is expressed primarily in brain, and to a lesser extent in the 
developing embryo. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

30 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to,.neurological disease states and developmental abnormalities. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune and 

35 nervous systems, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
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cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Moreover, smce this gene shares homology with a 
gene that maps to chromosome 11, (See Accession No.T87404), gene as well as its 
5 translated product may be used for linkage analysis on chromosome 1 1 . 

The tissue distribution and homology to paxillin indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for the treatment and or detection 
of disease states associated \yith abnormal signal transduction in brain and/or the 
developing embryo. This would include treatment or detection of neurodegenerative 
10 disease states and behavioral disorders such as Alzheimer's Disease, Parkinson's 
Disease, Huntingtons Disease, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder and panic disorder and also in the treatment and or detection of 
embryonic development defects. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 57 

This gene is expressed primarily in fetal spleen, brain, and to a lesser extent in 
six week old embryo. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inmiune disorders, neurological disorders, and developmental 
abnormalities. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing inmiunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

25 the immune and developmental systems, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 

30 or bodily fluid from an individual not having the disorder. Preferred epitopes include 
those comprising a sequence shown in SEQ ID NO: 290 as residues: Arg-28 to iGly-34. 

The expression of tiiis gene in fetal spleen indicates that polynucleotides and 
polypeptides corresponding to tiiis gene are useful for treatment/detection of inmiune 
disorders such as arthritis, astiima, immune deficiency diseases such as AIDS, and 

35 leukemia. In addition the expression of tiiis gene in tiie early embryo, indicates a key 
role in embryo development and hence the gene or gene product could be used in the 
treatment and or detection of embryonic development defects. This would include 
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treatment or detection of neurodegenerative disease states and behavioral disorders such 
as Alzheimer's Disease, Parkinson's Disease, Huntintons Disease, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder and panic disorder and also 
in the treatment and or detection of embryonic development defects. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 58 
The translation product of this gene shares sequence homology with the gene disrupted 
in the neurodegenerative disease dentatorubal-pallidoluysian atrophy. Moreover a long 
open reading fame exists in an alternative frame. Preferred polypeptide fragments 
10 comprise the following: 

MGSSQSVEIPGGGTEGYHVLRVQENSPGHRAGLEPFFDnVSINGSRLNKDND 
TLKDLLKXNVEKPVKMLIYSSKTLELRETSVTPSNLWGGQGLLGVSIRFCSFD 
GANENVWHVLEVESNSPAALAGLRPHSDVnGADTVMNESEDLFSLIETHEAKP 
LKLYVYNTDTDNCREVirrPNSAWGGEGSLGCGIGYGYLHRIPTRPFEEGKKIS 

1 5 LPGQMAGTPITPLKDGFTEVQLSSVNPPSLSPPGTTGIEQSLTGLSISSTPPA VSS 
VIJSTGVPTVPLIJPQVNQSLTSVPPMNPATTLPGLMPLPAGLPNLPNLNLNLPA 
PHIMPGVGLPELVNPGLPPLPSMPPRNLPGIAPLPLPSEFLPSFPLVPESSSAASS 
GELLSSLPPTSNAPSDPATTTAKADAASSLTVDVTPPTAKAPTTVEDRVGDSTPV 
SEKPVSAAVDANASESP (SEQ ID NO:512); SVEIPGGGTEGYHVLRVQENSPGH 

20 RAGLEPFFDFIVSINGSRLNKDNDTLKDLLKXNVEKPVKMLIYSSKTLELRErS 
VTPSNLWGGQGLLGVSIRFCSFDGANENVWH (SEQ ID NO:513); ESNSPAA 
LAGLRPHSDYnGADTVMNESEDLFSLIETHEAKPLKLYVYNTDTDNCREVnTP 
NSAWGGEGSLGCGIGYGYLHRIPTRPFEEGKKISLPGQMAGTPITPLKDGFTEV 
QLSSVNPPSLSPPGTTGIEQSLTG LSISS (SEQ ID NO:514); RIPTRPFEEGKKI 

25 SLPGQMAGTPrrPLKDGFTEVQLSSVNPPSLSPPGTTGIEQSLTGLSISSTPPAVS 
SVLSTGVPTVPLLPPQVNQSLTSVPPMNPATTLPGLMPLPAGLPNLPNLNLNLP 
APHIMPGVGLPELVNPGLPPLPSMPPRN (SEQ ID NO:516); PGLPPLPSMPPRN 
LPGIAPLPLPSEFLPSFPLVPESSSAASSGELLSSLPPTSNAPSDPATTTAKADAA 
SSLTVDVTPPTAKAPTTVEDRVGDSTPVSEKPVSAAVDAN (SEQ ID NO:517). 

30 This gene is expressed primarily in prostate cancer, and to a lesser extent in the 

pineal glands and in fetal lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

35 not limited to, neurological conditions and pulmonary disorders. Similarly, 

polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
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a number of disorders of the above tissues or cells, particularly of the nervous^ 
pulmonary, and endocrine systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
5 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ED NO: 291 as residues: Asn-9 to Leu-14. 
The abundance of this gene in the pineal gland and its homology to a gene 

10 disrupted in the neurodegenerative disease state Dentatorubral-pallidoluysian atrophy 
indicates that this gene may be useful in the U-eatment and/or detection of other 
neurodegenerative disease states and behavioral disorders such as Alzheimer's Disease, 
Parkinson's Disease, Huntingtons Disease, schizophrenia, mania, dementia, paranoia, 
obsessive compulsive disorder and panic disorder. The abundance of this gene in fetal 

1 5 lung would suggest that misregulation of the expression of this protein product in the 
adult could Jead to lymphoma or sarcoma formation, particularly in the lung; that it may 
also be involved in predisposition to certain pulmonary defects such as pulmonary 
edema and embolism, bronchitis and cystic fibrosis; and thus tiie gen or the gene 
protein encoded by the gene could be used in the detection and/or treatment of these 

20 pulmonary disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 59 
This gene is expressed primarily in the developing embryo. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental abnormalities. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing inununological probes for 
differential identification of die tissue(s) or cell type(s). For a number of disorders of 

30 the above tissues or cells, particularly of the developmental system, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 

35 expression level in healthy tissue or bodily fluid firom an individual not having the 
disorder. 
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The expression of this gene primarily in the embryo, indicates the gene plays a 
key role in embryo development and that the gene or the protein encoded by the gene 
could be used in the treatment and or detection of developmental defects hi the embryo 
or in infants. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 60 

This gene displays homology to nestin, an intermediate filament protein, the 
expression of which correlates with the proliferation of Central Nervous System 
progenitor cells and that is useful in the identification of brain tumors. This gene maps 

10 to chromosome 1 , and therefore, may be used as a marker in linkage analysis for 
chromosome 1 (See Accession No. AA527348). 

This gene is expressed primarily in kidney and to a lesser extent in brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, renal disorders and neurodegenerative conditions. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the excretory and 

20 nervous systems, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

25 individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 293 as residues: ThrTl28 to Asn-135. 

The tissue distribution and homology to nestm indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for detection and/or treatment of 
neurodegenerative disease states and behavioral disorders such as Alzheimer*s Disease, 

30 Parkinson's Disease, Huntingtons Disease, schizophrenia, mania, dementia, paranoia, 
obsessive compulsive disorder and panic disorder. In addition, its abundance m kidney 
indicates that it is useful m the treatment and detection of acute renal failure and other 
disease states associated with the kidney. 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 61 

Gene shares homology with the lau-ophilin-related protein 1 precursor as well as 
the calcium-independent alpha-latrotoxin receptor. Preferred polypeptide fragments 
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comprise the following amino acid sequence: 

lYKVFRHTAGLKPEVSCFENIRSCARXXXXXXXXXXXXWIFGVLHVVHASVV 
TAYLFTVSNAFQGMFIFLFLCVl^RKIQEEYYRLFKNW^^ (SEQ ID NO:518); 
WIFGVLHVVHASVVTAYLFTVSNAFQGMFIFLFLCVl^RKIQEEYYR^ 
C (SEQ ID NO:519). Also preferred are polynucleotide fragments encoding these 
polypeptide fragments. (See Accession No. 2213659) The translation product of this 
gene shares sequence homology with CD 97, a seven transmembrane bound receptor. 

This gene is expressed primarily in infant brain and in endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurological disorders and hematopoeitic disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inununologicai probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the neurological and 
hematopoeitic systems, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another' 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 294 as residues: Lys-13 to Leu-21. 

The tissue distribution of this gene suggest that it may be useful in the detection 
and/or U-eatment of neurodegenerative disease states and behavioral disorders such as 
Alzheimer's Disease, Parkinson's Disease, Huntingtons Disease, schizophrenia, mania, 
dementia, paranoia, obsessive compulsive disorder and panic disorder, while its 
expression in hematopoietic cell types indicates that the gene could be important for the 
treatment or detection of immune or hematopoietic disorders including arthritis, asthma 
and inmiunodeficiency diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 62 

This gene is expressed primarily in fetal liver and fetal spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, hematological and inununologicai disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inununologicai probes 
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for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune and hematopoetic systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 295 as residues: Ser-91 to Lys-98. 

The tissue distribution of this gene fetal liver and spleen indicates that the gene 
could be important for the treatment or detection of immune or hematopoietic disorders 
including arthritis, leukemia, asthma and immunodeficiency diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 63 

Gene shares homology with human serum amyloid protein. Preferred polypeptide 
fragments comprise the following amino acid sequence: 

ALTRIPPGDWVINVTAVSFAGKTTARFFHSSPPSLGDQARTDPGHQRRD (SEQ 
ID NO:520) (See Accession No. W13671). Also preferred are polynucleotide 
fragments encoding these polypeptide fragments This gene maps to chromosome 9, and 
therefore, may be used as a marker in linkage analysis for chromosome 9 (See 
Accession No. AA004342). 

This gene is expressed primarily in fetal liver and spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, hematopoietic and inmiune disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inununological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the hematopoietic and immune systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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The tissue distribution of this gene in fetal liver-spleen indicates that the gene 
could be important for the treatment or detection of immune or hematopoietic disorders 
including arthritis, leukemia, asthma, and immunodeficiency diseases. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 64 

This gene maps to chromosome 3, and therefore, may be used as a marker in 
linkage analysis for chromosome 3 (See Accession No. AA2 19669). 
This gene is expressed specifically in the brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
10 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurodegenerative disease states. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
15 the above tissues or cells, particularly of the neurological systems, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken fix>m an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
20 expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection/treatment of neurodegenerative 
disease states and behavioral disorders such as Alzheimer's Disease, Parkinson's 
25 Disease, Huntintons Disease, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder and panic disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 65 

Gene shares homology with a yeast protein. Preferred polypeptide fragments 
30 comprise the following amino acid sequence: LQEVNTTLPENSVWYERYKFDIP 

VFHL (SEQ ID NO:52 1). Also preferred are polynucleotide fragments encoding these • 
polypeptide fragments. (See Accession No. 1332638) 

This gene is expressed primarily in fetal tissue (fetus and fetal liver). 
Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, liver disorders and cancers (e.g. hepatoblastoma). Similarly, 
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polypeptides and antibodies directed to these polypeptides arc usefial in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the hepatic system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ JD 
NO: 298 as residues: Asn-59 to Glu-64. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection and treatment of liver disorders 
and cancers (e.g. hepatoblastoma, jaundice, hepatitis, liver metabolic diseases and 
conditions diat are attributable to the differentiation of hepatocyte progenitor cells). In 
addition tiie expression in fetus would suggest a useful role for the protein product in 
developmental abnormalities, fetal deficiencies, pre-natal disorders and various would- 
healing models and/or tissue trauma. ' 

FEATURES OF PROTEIN ENCODED BY GENE NO: 66 

Gene has homology with a B-cell surface antigen which may indicate gene plays 
a role in the immune response, including, but not limited to disorders and infections of 
the immune system. Preferred polynucleotide fragments comprise the following 
sequence: TAGCATGTAGCCAGTCGAATAACNTATAAGGACAAAGTGGAGTC 
CACGCGTGCGGCCGTCTAGACTAGTGGATCCCCCGGCTGCAGGATTCGGC 
ACGAG (SEQ ID NO:523). Also preferred are polypeptide fragments encoded by 
these polynucleotide fragments (See Accession No.T94535). Additionally, this gene 
shares homology with an interferon-gamma receptor. Preferred polypeptide fragments 
also comprise the following amino acid sequence: MQGSGSQFRACLLCLCFSCPC 

SPGGPRWNSRQGGRRFPKTCRAISQNLVFKYKTFCPVRYMQPHRSSLCLHFTS 
YVFILSTWGSLRTYSTDLKKKKKNSRGGPVPIRPKS (SEQ ID NO:522); 

MQGSGSQFRACLLCLCFSCPCSPGGPRWNSRQGGRRFPKTCRAISQNLVFK 
(SEQ ID NO:524); PVRYMQPHRSSIXXHFTSYVFELSTWGSLRTYSTDLKKKKK 
NSRGGPVPIRPKS (SEQ ID NO:525); and GEEQRDCSLGWRGVGMRATHCQAA 
RMFVLFSLPKYAGL (SEQ ID NO:526). Also preferred are polynucleotide fragments 
encoding these polypeptide fragments 

Hiis gene is expressed primarily in T-cells and gall bladder. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immunological disorders and conditions (immunodeficiencies, cancer, 
5 leukemia, hematopoeisis). Similarly, polypeptides, and antibodies directed to these 
polypeptides are useful in providing inmiunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune and digestive systems, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

10 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 299 as residues: 

15 Thr-41 toGly-52. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and diagnosis of immune 
disorders including: leukemias, lymphomas, auto-immune disorders, immuno- 
supressive (transplantation) and immunodeficiencies (e.g. AIDS), inflammation and 

20 hematopoeitic disorders. The expression of this gene in gall bladder would suggest a 
possible role for this gene product in digestive disorders, particularly of the pancreas. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 67 

This gene maps to chromosome 1 1, and therefore, may be used as a marker in 

25 linkage analysis for chromosome 1 1 (See Accession No, AAOl 1622). 

This gene is expressed primarily in a variety of fetal and developmental tissues 
(e.g. fetal spleen, infant brain). 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

30 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental, immune or neurological abnormalities. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the developing 

35 immune and central nervous systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
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another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 300 as residues: Ser-38 to Ser-43. 
5 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for developmental abnormalities or fetal 
deficiencies. The detection in infant brain would suggest a role in neurological disorders 
(both developmental and neurodegenerative conditions of the brain and nervous system, 
behavioral disorders, depression, schizophrenia, Alzheimer's disease, Parkinson's 
10 disease, Huntington's disease, mania, dementia). In addition, the detection in spleen 
would similarly suggest a role in detection and treatment of immunologically mediated 
disorders (e.g. inununodeficiency, inflammation, cancer, wound healing, tissue repair, 
hematopoeisis). 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 68 

This gene is expressed primarily in spleen, T-cells, and fetal heart. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

20 not limited to, immunological deficiencies, including AIDSand cardiovascular 

disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing inmiunological probes for differential identification of the lissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and cardiovascular systems, expression of this gene at significantly higher 

25 or lower levels may be routinely detected in certain tissues (e.g., cancerous and 

wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relativelo the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

30 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the diagnosis and treatment of immune 
disorders including: leukemias, lymphomas, autoimmune disorders, 
inmiunodeficiencies (e.g. AIDS), immuno-suppressive conditions (transplantation) and 
hematopoeitic disorders. The expression in fetal heart indicates that polynucleotides and 

35 polypeptides corresponding to this gene are useful for the treatment and diagnosis of 
cadiovascular disorders (e.g. heart disease, restenosis, atherosclerosis, stoke, angina, 
thrombosis). 



wo 98/54963 



PCT/US98/11422 



56 



FEATURES OF PROTEIN ENCODED BY GENE NO: 69 

Gene shares homology with a human collagen protein. Preferred polypeptide 
fragments comprise the following amino acid sequence: 
5 MPRKTSKCRQLLCSGASRNADTAARQSTCSSHRPPGKIPSLGPRRXPGCXSVP 
SSRGEQSTGSPAAPRCGRRDAHRGLPGGAAMTPGDTWASFNPRAGHSKSQGE 
GQESSGASRQDRHPVSHWVERQREAWGAPRSSSAGGVKVAATTEREPEFKIK 
TGKA (SEQ ID NO:527); CSGASRNADTAARQSTCSSHRPPGKIPSLGPRRXPG 
CXSVPSSRGEQSTGSPAAPRCGRRDAHRGLPGGAAMTPGDTWASFNPRAGHS 
10 (SEQ ID NO:528); QGEGQESSGASRQDRHPVSHWVERQREAWGAPRSSSAGG 
VKVAATTEREPEFKIKTGKA (SEQ ID NO:529) (See Accession No. 124886). Also 
preferred are polynucleotide fragments encoding these polypeptide fragments 
This gene is expressed primarily in fetal heart. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

15 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cardiovascular disorders. Similarly, polypeptides and antibodies dkected 
to these polypeptides are useful in providing inmiunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 

20 tissues or cells, particularly of the cardiovascular system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

25 in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 302 as residues: 
Pro-32 to Ser-39. 

The tissue distribution mdicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and diagnosis of cadiovascular 

30 disorders (e.g. heart disease, restenosis, atherosclerosis, stroke, angina, thrombosis). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 70 

The translation product of this gene shares sequence homology with a chicken 
single-strand DNA-binding protein. Preferred polypeptide fragments comprise the 
35 following amino acid sequence: 

MSPRYPGGPRPPLRIPNQALGGVPGSQPLLPSGMDPTRQQGHPNMGGPMQRM 
TPPRGMVPLGPQNYGGAMRPPLNALGGPGMPGMNMGPGGGRPWPNPTNAN 
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SIPYSSASPGNYVGPPGGGGPPGTPIMPSPADSTNSGDNMYTLMNAVPPGPNR 
PNFPMGPGSDGPMGGLGGMESHHMNGSLGSGDMDSISKNSPNNMSLSNQP 
GTPRDDGEMGGNFLNPFQSESYSPSMTMSV (SEQ ID NO:530); MSPRYPGG 
PRPPLRIPNQALGGVPGSQPLLPSGMDPTRQQGHPNMGGPMQRMTPPRGMVP 
5 LGPQNYGGAMRPPLNALGGPGMPGMNMGPGGGRPWPNPTNANSIPYSSASP 
GNY (SEQ ID. NO:531); LNALGGPGMPGMNMGPGGGRPWPNPTNANSIPYSS 
ASPGNYVGPPGGGGPPGTPIMPSPADSTNSGDNMYTLMNAVPPGPN (SEQ ID 
NO:532); GPMGGLGGMESHHMNGSLGSGDMDSISKNSPNNMSLSNQPGTPR 
DDGEMGGNFLNPFQSESYSPSMTMSV (SEQ ID NO:533); TCEHSSEAKAFHDY 

10 (SEQ ID NO:534). Also preferred are polynucleotide fragments encoding these 
polypeptide fragments. (See Accession No. 1562534) 

This gene is expressed primarily in placenta and to a lesser extent in the fetal 
heart and a variety of other tissues and cell types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

15 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental abnormalities, fetal deficiencies, and particularly of the 
cardiovascular system. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

20 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the reproductive system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

25 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection and treatment of developmental 
abnormalities or fetal deficiencies, ovarian and other endometrial cancers, reproductive 

30 dysfunction, cardiovascular disorders, and pre-natal disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 71 

This gene is expressed primarily in fetal liver and to a lesser extent in the breast 
and testes. 

35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, liver disorders (including hepatoblastomas) and reproductive disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
hepatic and reproductive systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healtiiy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for detection and treatment of liver disorders and 
cancers (e.g. hepatoblastoma, jaundice, hepatitis, liver metabolic diseases and 
conditions tiiat are attributable to the differentiation of hepatocyte progenitor cells). The 
expression in testes and breast indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection and treatment of endocrine and 
reproductive disorders (e.g. sperm maturation, milk production, testicular and breast 
cancers). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 72 

This gene maps to chromosome 1, and therefore, may be used as a marker in 

linkage analysis for chromosome 1 (See Accession No. W93595). 

This gene is expressed primarily in smooth muscle and to a lesser extent in 

brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cardiovascular and neurological disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the cardiovascular and central nervous 
systems, expression of this gene at significantiy higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
semm, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid fix)m an 
individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides . 
corresponding to this gene are useful for the detection and treatment of restenosis, 
atherosclerosis, stroke, angina, thrombosis, wound healing and other conditions of 
heart disease. In addition, the expression in brain would suggest that polynucleotides 
5 and polypeptides corresponding to this gene are useful for the detection and treatment of 
developmental, degenerative and behavioral conditions of the brain and nervous system 
(e.g. schizophrenia, depression, Alzheimer's disease, Parkinson's disease, 
Huntington's disease, mania, dementia, paranoia, addictive behavior and sleep 
disorders). 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 73 

Gene shares homology with human stromalin-2. Preferred polypeptide 
fragments comprise the following amino acid sequence: 
QAFVLL^DLLUFSPQMIVGGRDFUlPLXaTPEATLQSELASF^ 

15 GSGA (SEQ ID NO:535); ACSYLLCNPEFTFFSRADFARSQLVDLLTDRFQQE 
LEELLQVG (SEQ ID NO:536),QKQLSSLRDRMVAFCELCQSCLSDVDTEIQEQV 
ST (SEQ ID NO:537); QVILPALTLVYFSILWTLTHISKSDAS (SEQ ID NO:538); 
STHDLTRWELYEPCCQLLQKAVDTGXVPHQV (SEQ ID NO:539). Also prefened 
are polynucleotide fragments encoding these polypeptide fragments (See Accession 

20 NO.R65208 ) This gene maps to chromosome 7, and therefore, may be used as a 
marker in linkage analysis for chromosome 7 (See Accession No. D52585). 

This gene is expressed primarily in the brain (infant brain, adult brain, pituitary, 
cerebellum, hippocampus, schizophrenic hypothalmus, amygdala). 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental and neurodegenerative diseases of the brain and nervous 
system. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing inmiunological probes for differential identification of the tissue(s) or cell 

30 type(s). For a number of disorders of the above tissues or cells, particularly of the 
central nervous system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 

35 standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include tiiose 



wo 98/54963 PCTAJS98/11422 

60 

comprising a sequence shown in SEQ ID NO: 306 as residues: Thr-25 to Lys^Se, Lys- 
55toSer-63. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for detection and treatment of developmental, 
5 degenerative and behavioral conditions of the brain and nervous system (e.g. 

schizophrenia, depression, Alzheimer's disease, Parkinson's disease, Huntington^s 
disease, mania, dementia, paranoia, addictive behavior and sleep disorders). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 74 
10 This gene is expressed primarily in the hypothalamus of a human suffering from 

schizophrenia. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

15 not limited to, disorders of the CNS particularly schizophrenia. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the CNS, such as schizophrenia 
expression of this gene at significantly higher or lower levels may be routinely detected 

20 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 

25 NO: 307 as residues: Gly-38 to Ala-44. 

The tissue distribution indicates that the protein products of this gene are useful 
for the study, diagnosis and treatment of schizophrenia and other disorders involving 
the CNS. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 75 

Preferred polypeptides of the invention comprise the following amino acid 
sequence encoded by this gene: 

LAVSTSnCCADISTALPLGSSRPAPAPRHREHEHGHQARPPRLIATSU^PI^^ 
AAAQLLWTQLTPMGGRPGGRHSPPTLHTGPRALPPGPPHPSLHVAALSLLR 
35 (SEQ ID NO:540)- Polynucleotides encoding such polypeptides are also provided. 

This gene is expressed primarily in endometrial tumor and to a lesser extent in 
amniotic cells. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, reproductive and immune disorders particularly cancers of those 
5 systems. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing inununological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the reproductive and immune systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 

10 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid firom an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 308 as residues: Ser-3 to Arg-9. 

1 5 The tissue distribution indicates that the protein products of this gene are useful 

for study and treatment of immune and reproductive disorders particularly cancers of 
those systems. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 76 

20 This gene is expressed primarily in kidney cortex and to a lesser extent in early 

stage human brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

25 not limited to, renal disorders such as renal cancer. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing inununological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the kidney expression of this gene at 
significantiy higher or lower levels may be routuiely detected in certain tissues (e.g., 

30 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken fix)m an individual having 
such a disorder, relative to die standard gene expression level, i.e., tiie expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown m SEQ ID NO: 309 as residues: 

35 Gly-38 to Gly-45, Gly-47 to Gly-52, Pro-92 to Lys-1 10. 

The tissue distribution indicates that the protein products of this gene are useful 
for study, treatment and diagnosis of renal diseases such as cancer of die kidney. 
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FEATURES OF PROTEIN ENCODED BY GENE NO; 77 
This gene is expressed primarily in kidney medulla. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, metabolic and renal disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 

10 the above tissues or cells, particularly of the metabolic and renal systems, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 

15 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
for study, treatment and diagnosis of metabolic and renal diseases and disorders. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 78 

This gene is expressed in chronic synovitis and nnicrovascular endothelium. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

25 not limited to, arthritis and atherosclerosis. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the vascular and skeletal systems, expression 
of this gene at significantly higher or lower levels may be routinely detected m certain 

30 tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not havmg the 
disorder. 

35 The tissue distribution indicates that the protein products of this gene are usefid 

for study, diagnosis and treatment of arthritic and other inflammatory diseases as well 
as cardiovascular diseases. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 79 

This gene is expressed in resting T-cells arid activated monocytes. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inmiune disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
10 tissues or cells, particularly of the immune system, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
1 5 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein products of tiiis gene are useful 
for the study and treatment of inunune diseases such as inflammatory conditions. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 80 

20 This gene is expressed in a variety of inmiune system tissues, e.g., neutrophils, 

T-cells, and TNF induced epithelial and endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

25 not limited to, infectious and immune disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune and vascular systems, expression 
of diis gene at significanUy higher or lower levels may be routinely detected in certain 

30 tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or anotiier tissue or cell sample taken from an 
individual having such a disorder, relative to tiie standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid firom an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 

35 NO: 3 13 as residues: Met-1 to Trp-6. 

The tissue distribution indicates that die protein products of this gene are useful 
for study and treatment of infectious diseases, inunune and vascular disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 81 
This gene is expressed in activated neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammation and other immune conditions. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
10 of the above tissues,or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
1 5 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
for study and treatment of immune disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 82 

20 This gene is expressed in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammatory and other immune conditions. Similarly, polypeptides and 

25 antibodies dkected to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantiy higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial 

30 fluid or spinal fluid) or another tissue or cell sample taken fix)m an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healtiiy tissue or bodily fluid from an individual not having tiie disorder. Preferred 
epitopes include tiiose comprising a sequence shown in SEQ ID NO: 315 as residues: 
Ala-83toThr^91. 

35 The tissue distribution indicates that the protein products of this gene are useful 

for study and treatment of immune disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 83 
This gene is expressed in human neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present m a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammation and immune disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inmiunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

10 of the above tissues or cells, particularly of the immune and inflammatory system, 

expression of this gene.at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 

15 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
for diagnosis and treatment of disorders of the inflammatory and immune systems. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 84 

This gene is expressed in human neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

25 not limited to, disorders of the inflammatory and immune systems. Similarly, 

polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the inflammatoiy and 
inmiune systems, expression of this gene at significantly higher or lower levels may be 

30 routinely detected in ccrtam tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to die standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

35 The tissue distribution indicates that the protein products of this gene are useful 

for diagnosis and treatment of disorders of the immune and inflammatoiy systems. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 85 
This gene is expressed in activated neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammation and immune system diseases. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inununological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

10 of the above tissues or cells, particularly of the immune system and inflammatory 

system, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e,g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken fi-om an individual having such a disorder, relative to the standard gene 

15 expression level, i.e., the expression level in healthy tissue or bodily fluid from an , 
individual not having the disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
for diagnosis and treatment of diseases of the inflammatory and inunune systems. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 86 

This gene is expressed in activated neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

25 not limited to, inflammation and immune system disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the inflammatory and inunune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 

30 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes mclude those comprising a sequence shown in SEQ ID 

35 NO: 319 as residues: Met-1 to Gly-6, Gly-32 to Pro-43, Leu-55 to Gln-60. 

The tissue distribution indicates that the protein products of this gene are useful 
for diagnosis and treatment of disorders of the inunune and inflammatory system. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 87 

In specific embodiments, polypeptides of the invention comprise the sequence: 
EQVLALLWPRFELILEMNVQSVRSTDPQRLGGLDTRPHYITRRYAEFSSALVSIN 
QTPNERTMQLLGQLQVEVENFVLRVAAEFSSRKEQLVFLINNYDMNILGVLME 
RAADDSKEVESFQQLLNARTQEFIEELLSPPFGGLVAFVKEAEALIERGQAERLR 
GEEARVTQLIRGFGSSWKSSVESLSQDVMRSFTNFRNGTSIIQG (SEQ ID 
NO:541),ALLKYRFFYQFLLGNERATAKEIRDEYVETLSKIYLSYYRSYLGRLMK 
VQYEEVAEKDDLMGVEDTAKKGFXSKPSRSRNTIFTLGTRGSVISPTELEAPILV 
PHTAQR (SEQ ID NO: 542); EQRYPFEALFRSQHYXLLDNSCREYLnCEFFWS 
GPXAHDUTIAVMGRTLSMTLKHLDSYLADCYDAIAVFLCIHIVLRFRNIAAKRD 
VPALDRYW (SEQ ID NO:543),GGLDTRPHYITRRYAEFSSALVSINQ (SEQ ID 
NO:544); SRKEQLVFLINNYDMMLGVL (SEQ ID NO: 545) and/or ALLKYRFFY 
QFIXGNERATAKEIRDEYVEn^KIYLSYYRSYLGRLMKVQYEEVAEKDDLMG 
VEDTAKKGFXSKPSLRSRNTIFTLGTRGSVISPTELEAPILVPHTAQRXEQRYPF 
EAUTlSQHYXLLDNSCREYLHCEFFWSGPXAHDLFHAVMGRTLSNm.KHLD 
SYLADCmAL\VFmHIVlJlFRNIAAKRDWAlI)RYWEQVI^^ 
NVQSWSTDPQRLGGUDTRPHYITRRYAEFSSALVSINQTIPI^RTMQUXjQ 
EVENFVUIVAAEFSSRKEQLVFLINNYDMMLGVIJ^IERAADDSKEVESFQQL^ 
ARTQEFIEELLSPPFGGLVAFVKEAEALIERGQAERLRGEEARVTQLIRGFGSSW 
KSSVESLSQDVMRSFTNFRNGTS (SEQ ID NO:546). PolynucleoUdes encoding 
these polypeptides are also encompassed by the invention. The translation product of 
this gene shares sequence homology with suppressor of actin mutation which is thought 
to be important in mutation suppression. 

This gene is expressed primarily in fetal liver and to a lesser extent in a variety 
of other tissues. 

' Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, liver and mutations. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological pr<*es for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the liver or cancer, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
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in healthy tissue or bodily fluid fiom an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 320 as residues: 
Val-53 to Arg-60, Thr-88 to Thr-94, AIa-142 to Ser-150, Gly-188 to Glu-196, Gly- 
208 to Ser-214. Thr-227 to Gly-232, Lys-279 to Phe-285. 
5 The tissue distribution and homology to suppressor of actin mutation suggest 

that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and of liver disorder or cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 88 

10 This gene maps to chromosome 9, and therefore can be used in linkage analysis 

as a marker for chromosome 9. In specific embodiments, polypeptides of the invention 
comprise the sequence: 

YEGKEFDYWSroVl^GPSYKLPYNTSDDPWLTAYNFLQKNDLNPMFLDQVA 
KFIIDNTKGQMLGLGNPSFSDPFTGGGRYVPGSSGSSNTLPTADPFTGAGRYV 

15 PGSASMGTTMAGVDPFTGNSAYRSAASKTMNIYFPKKEAVTFDQANPTQILGK 
LKELNGTAPEEKKLTEDDLILLEKILSUCNSSSEKPTVQQLQILWKAINCPEDIV 
FPALDILRLSKHPSVNENFCNEKEGAQFSSHUNLLNPKGKPANQLLALRTFC 
NCFVGQAGQKLMMSQRESLMSHAIELKSGSNKNI (SEQ ID NO: 547); 
HIALAH.ALNYSVCFHKD (SEQ id NO: 548); HNIEGKAQCLSLISTILEWQ 

20 DLEATFRLLVALGTLISDDSNAVQLAKS (SEQ ID NO:549); LGVDSQIKKYSS 

VSEPAKVSECCRFILNLL (SEQ ID NO: 550); and/or YEGKEFDYVFSIDVNEGGPS 
YKLPYNTSDDPWLTAYNFLQKNDLNPMFLDQVAKFIIDNTKGQMLGLGNPSFS 
DPFTGGGRYVPGSSGSSNTLPTADPFTGAGRYVPGSASMGTTMAGVDPFTGN 
SAYRSAASKTMNrm>KKEAVTFDQANPTQILGKLKELNGTAPEEKKLTEDDLI 

25 LLEKILSLICNSSSEKPTVQQLQILWKAINCPEDIVFPALDILRLSIKHPSVNENFC 
NEKEGAQFSSHLINLLNPKGKPANQLLALRTFCNCFVGQAGQKLMMSQRESL 
MSHAIELKSGSNKNIHIALATLAIJSfYSVCFHKDHNIEGKAQCLSLISTI^ 
LEATFRLLVALGTUSDDSNAVQLAKSLGVDS(3IKKYSSVSEPAKVSECCRFILN 
LL (SEQ ID NO:551). Polynucleotides encoding these polypeptides are also 

30 encompassed by the invention. These polypeptides share significant homology with 
phospholipase A2 activating protein which is thought to be important in signal 
transduction (see, e.g., Wang et al.. Gene 161(2):237-241 (1995)). 

This gene is expressed primarily in endothelial cells, to a less extent in placenta, 
endometrial stromal cells, osteosarcoma, testis tumor, muscle, and infant brain that are 

35 likely to be rich in blood vessles. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, disorders in vascular system, aberrent angiogenesis, tumor angiogenesis. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
5 type(s). For a number of disorders of the above tissues or cells, particularly of the 
vascular system or tumors, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

10 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution of this gene in endothelial cells and several potential 
highly vascularized tissues and its homology to phospholipase A2 activating protein 
suggest that this gene may be involved in transducing signals for endothelial cells in 

15 angiogenesis or vasculogenesis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 89 

In specific embodiments, polypeptides of the invention comprise the sequence: 
YPNQDGDIUIDQVLHEHIQR1^K\^ANHRALQIPEVYUI^ 

20 AYKTPRDKVQCILRMCSTIMNLl^LANEDSVPGADDFWVLVFVLIK^ 

STVQYISSFYASCLSGEESYWWMQFTAAVE (SEQ ID NO:552); YPNQDGDILR 

DQ\a.HEHIQRLSKVVTANHRALQIPEVYLREAPWPSAQSEIRTISAYKTPRDKVQ 

CILRMCSTIMNLLSLANEDSVPGADDFVPVLVFVLIKANPP^^ 

SCLSGEES YWWMQFTAAVEHKTI (SEQ ID NO:553); YPNQDGDILRDQVL (SEQ 

25 ID NO:554); EAPWPSAQSEI (SEQ ID NO:555); PVLVFVLIKANP (SEQ ID 
NO:560); SGEESYWWMQFTAAVEFIKTI (SEQ ID NO:556); ADDFVPVLVF 
VUKANPP (SEQ ID NO:557); YKTPRDKVQCIL (SEQ ID NO:558); and/or 
GADDFVPVLVFVLIK (SEQ ID NO:559). The translation product of this gene shares 
sequence homology with human ras mhibitor and yeast VPS9p which is thought to be 

30 important in golgi vacuole transport 

This gene is expressed primarily in T cells and melanocytes and to a lesser 
extent in a variety of other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

35 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, dysfunction and disorders involving T cells and melanocytes. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
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immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e-g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
5 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fiuid from an individual not having the 
disorder. 

The tissue distribution and homology to ras inhibitor indicates that 
10 polynucleotides and polypeptides corresponding to this gene are useful for regulating 
signal u-ansduction; diagnosis and treatment of disorders involving T cells and 
melanocytes. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 90 
1 5 This gene maps to chromosome 9 and therefore polypeptides of the invention 

can be used in linkage analysis as a marker for chromosome 9. The translation product 
of this gene shares sequence homology with neuronal olfactomedin-related ER localized 
protein which is thought to be important in influence the maintenance, growth, or 
differentiation of chemosensory cilia on the apical dendrites of olfactory neurons. In 
20 specific embodiments, polypeptides of the invention comprise the sequence: 

SARASTQPPAGQHPGPC (SEQ ID NO:561); MPGRWRWQRDMHPARKLLSLL 
FLILMGTELTQD (SEQ ID NO:562); S AAPDSLLRSSKGSTRGSL (SEQ ID 
NO:563); AAIVIWRGKSESRIAKTPGI (SEQ ID NO:564); FRGGGTLVLPPTHT 
PEWLIL (SEQ ID NO:567); PLGITLPLGAPETGGGD (SEQ ID NO:565); and/or 
25 CAAETWKGSQRAGQLCALLA (SEQ ID NO:566). 
This gene is expressed in pineal gland. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceU type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

30 not limited to, neurological and endocrinological disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the neurological or endocrine systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 

35 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e.. 
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the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 323 as residues: Leu-20 to Ala-26, Arg-32 to Arg-39, Thr-104 to Gly-112. 

The tissue distribution and homology to olfactomedin-related protein indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 
maintenance, growth, or differentiation of neuron cells in pineal gland, therefore, may 
be useful for diagnosis and treatment of neurological disorders in pineal gland. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 91 

This gene is expressed primarily in prostate and apoptotic T cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, prostate disease and T cell dysfunction. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the prostate cancer, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue disttibution indicates tiiat polynucleotides and polypeptides 
corresponding to this gene are useful for detect abnormal activity in prostate and T cells 
or probably treatment of this abnormality. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 92 

TTiis gene is expressed primarily in prostate and to a lesser extent in smooth 
muscle cells, fibroblasts, and placenta. 

Therefore, polynucleotides and polypeptides of tiie invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which mclude, but are 
not limited to, disorders in prostate or vascular system. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the prosate or vascular system, expression 
of tiiis gene at significantiy higher or lower levels may be routinely detected in certain 
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tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, pdasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
5 disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for regulating function of prostate or highly 
vascularized tissues, e.g. placenta. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 93 

This gene is expressed primarily in embryos and fetal tissues stage human and 
to a lesser extent in a wide variety of other proliferative tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of die tissue(s) or cell type(s) present in a 
15 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, disorders in embryonic development and cell proliferation. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the embryonic tissues 
20 and proliferative cells, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
25 fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis or treatnient of abnormalities in 
developing and proliferative cells and organs. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 94 

The translation product of this gene shares sequence homology with 
transformation related protein which is thought to be important in transformation, 

This gene is expressed primarily in female reproductive tissues, i.e., breast 
cancer cells, placenta, and ovary and to a lesser extent in fetal lung. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, cancer or dysfunction of reproductive tissues. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing inununological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the reproduction system, 
5 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

10 disorder. Preferred epitopes mclude those comprising a sequence shown in SEQ ID 
NO: 327 as residues: Ser-50 to Pro-61. 

The tissue distribution and homology to transformation related protein indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of conditions caused by transformation, i.e. tumorigenesis in 

15 reproductive organs, e.g. breast, placenta, and ovary. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 95 

This gene is expressed primarily in testes, rhabdomyosarcoma, infant brain and 
to a lesser extent in some tumors and highly vascularized tissues. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, tumorigenesis, abnormal angiogenesis, and/or neurological disorders. , 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

25 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
tumor tissues or vascular tissues, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 

30 another tissue or cell sample taken from an individual havmg such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 328 as residues: Arg-46 to Trp-54, Pro- 
60 to IIe-69, Asn-1 16 to Ala-122, Arg-147 to Lys-153, Ser-158 to Glu-170, IIe-399 to 

35 Ser-405, Pro-486 to Met-499, Pro-502 to Asp-508. 

The tissue distribution indicates that polynucleotides and polypeptides 
correspondmg to this gene are useful for a range of disease states including treatment of 
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tumor or vascular disorders and the treatment of neurological disorders such as 
Alzheimer's Disease, Parkinson's Disease, Huntingtons Disease, schizophrenia, mania, 
dementia, paranoia, obsessive compulsive disorder and panic disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 96 

This gene maps to chromosome 7 and therefore polynucleotides of the present 
invention can be used in linkage analysis as a marker for chromosome 7. The 
translation product of this gene is homologous to the Clostridium perfringens 
enterotoxin (CPE) receptor gene product and shares sequence homology with a human 
ORF specific to prostate and a glycoprotein specific to oligodendrocytes both of which 
are tissue specific proteins.(See e.g., Katahira et al., J Cell Biol. 136(6): 1239-1247 
(1997). PMID: 9087440; UI: 97242441. 

This gene is expressed primarily in pancreas tumor and ulcerative colitis and to a 
lesser extent in several tumors and normal tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, pancreatic disorder, ulcerative colitis, tumors and food poisoning. 
Similarly, polypeptides and antibodies duected to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
digestive system or tumorigenic system, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. Preferred epitopes include 
those comprising a sequence shown in SEQ ID NO: 329 as residues: Gly-147 to Met- 
152,Cys-177toLys-188. 

The tissue distribution and homology to prostate and oligodendrocyte-specific 
protein indicates that polynucleotides and polypeptides corresponding to this gene are 
useful for marker of diagnosis or treatment of disorder in pancreas, ulcerative colitis, 
and tumors. Furthermore, identity to the human receptor for Clostridium perfringenes 
entertoxin indicates that the soluble portion of this receptor could be used in the 
treatment of food poisoning associated with Clostridia perfiingens by blocking the 
activity of perfringens enterotoxin. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 97 

The translation product of this gene shares sequence homology with ATPase 
which is thought to be important in metabolism. 
5 This gene is expressed primarily in testes and several hematopoietic cells and to 

a lesser extent in other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
10 not limited to, leukemia and hematopoietic disorders. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the hematopoietic system, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
15 (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
20 NO: 330 as residues: Leu-37 to Ala-42. 

The tissue distribution and homology to ATPase indicates tiiat polynucleotides 
and polypeptides corresponding to this gene are useful for marker of diagnosis and 
treatment of leukemia and other hematopoietic disorders. 



25 FEATURES OF PROTEIN ENCODED BY GENE NO: 98 

In specific embodiments, polypeptides of the invention comprise the sequence: 
MRSARPSLGCLPSWAFSQALNI (SEQ ID NO:568); LLGLKGLAPAEIS AVCE 
KGNFN (SEQ ID NO:569); VAHGLAWS YYIGYLRLILPELQARIR (SEQ ID 
NO:570); TYNQHYNNLLRGAVSQRC (SEQ ID NO:571); ILLPLDCGVPDNLSM 

30 ADPNIRFLDKLPQQTGDRAGIKDRVYSN (SEQ ID NO:572); SIYELLENGQRAGT 
CVLEYATPLQITJAMSQYSQAGFSGEDRLEQ (SEQ ID NO:573); AKLFCRTLE 
DEADAPESQlWaU-IAYQEPADDSSFSLSQEVLRHLRQEEKEEVW 
PSTSTMSQEPELUSGMEKPLPLRTDFS (SEQ ID NO:574); and/or LLGLKGLA 
PAEIS AVCEKGNFNVAHGLAWS YYIGYLRLILPEL (SEQ ID NO:575). 

35 Polynucleotides encoding these polypeptides are also encompassed by the invention. 

This gene is expressed primarily in prostate BPH and to a lesser extent in bone 
marrow. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, benign prostatic hypertrophy or prostate cancer. Similarly, polypeptides 
5 and antibodies directed to these polypeptides are useful in providing inrununological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the male urinary system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 

10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder; Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 331 as residues: Ile-60 to Asn-69, Leu-106 to Asp-1 12, Glu-130 to Gly-136, Phe- 

15 160 to Glu-167, Pro-184 to Cys-190, Glu-197 to Ser-202, Arg-215 to Glu-221, Thr- 
237 to Pro-242. . 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis or treatment of benign prostatic 
hypertrophy or prostate cancer. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 99 

This gene is expressed primarily in salivary gland. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, disorders or injuries of the salivary gland. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of glandular tissues, expression of this gene at 

30 significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken fi-om an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

35 The tissue distribution indicates that polynucleotides and polypeptides r 

corresponding to tiiis gene are useful for treatment of disorders of, or injuries to the 
salivary gland or other glandular tissue. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 100 

This gene maps to chromosome 15, accordingly, polynucleotides of the 
invention can be used in linkage analysis as a marker for chromosome 15. The 
5 translation product of this gene shares sequence homology with a Celegans gene of 
unknown function. In specific embodiments, polypeptides of the invention comprise 
the sequence: DPRVRLNSLTCKHIHSLTQ (SEQ ID NO:583); TMKLLKLRRNIV 
KLSLYRHFTN (SEQ ID NO:576); TLILAVAASIVHIWTTMKFRI (SEQ ID 
NO:577); \nrCQSDWRELWVDDAIWRLLFSMILFVI (SEQ ID NO:578); MVLWR 

10 PSANNQRFAFSPLSEEEEEDEQ (SEQ ID NO:580); KEPMLKESFEGMKMRS 

TKQEPNGNSKVNKAQEDDL (SEQ ID NO:584); and/or KWVEENVPSSVTDVALP 
ALLDSDEERMTTHFERSKME (SEQ ID NO:582). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. 

This gene is expressed primarily in thyroid and to a lesser extent in 

1 5 osteoclastoma, kidney medulla, and lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, thyroid dysfunction or cancer. Similarly, polypeptides and antibodies 

20 directed to these polypeptides are useful in providing inununological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of die endocrine system, expression of this gene 
at significantiy higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

25 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to Uie standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 333 as residues: 
Lys-107 to Leu-124, Glu-150 to Thr-159, Pro-173 to Asp-179, Ser-192 to Ser-201. 

30 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnosis and treatment of thyroid dysfunction 
or cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 101 
35 This gene maps to chromosome 1 6, therefore polynucleotides of the invention 

can be used in linkage analysis as a marker for chromosome 16. In specific 
embodiments, polypeptides of the invention comprise the sequence: 



wo 98/54963 



PCTAJS98/11422 



IRHELTVLRDTRPACA (SEQ ID NO:585); and/or MDFXMAUYD (SEQ U) 
NO:586). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

This gene is expressed primarily in kidney cortex and to a lesser extent in adult 
5 brain, corpus colosum, hippocampus, and frontal cortex. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurological disorders. Similarly, polypeptides and antibodies directed to 
10 these polypeptides are useftil in providing immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the cenu-al nervous system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
15 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment or diagnosis of neurological 
20 disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 102 

In specific embodiments, polypeptides of the invention comprise the sequence: 
MQEMMRNQDRALSNLESIPGGYNA (SEQ ID NO:587); LRRMYTDIQEPMLSA 

25 AQEQF GGNPF (SEQ ID NO:588); ASLVSNTSSGEGSQPSRTENRDPLPNPWAP 
QT (SEQ ID NO:589); SQSSSASSGTASTVGGTTGSTASGTSGQSTTAPNLVPGV 
GASMFNTPG MQSLLQQITENPQLMQNMLSAPY (SEQ ID NO:590); 
MRSMMQSLSQNPDLAAQMMUWPIJAGNPQLQEQMRQQLPT^ (SEQ ID 
NO:591); MQNPDTl^AMSI^RAMQAIiQIQQGLQTLATEAPGUPGFTPG 

30 ALGSTGGSSGTNGSNATPSENTSPTAGT (SEQ ID NO:592); TEPGHQQH 
QQMLQAIAGVNPQLQNPEVRFQQQLEQI^AMGFU^RE^ 
DERLLGSQPS (SEQ ID NO:593); RNPAMMQEMMRNQDRALSNLESIPGGY 
NALRRMYTDIQEPMLS AA (SEQ ID NO:594); GNPFASLVSNTSS (SEQ ID 
NO:595); ENRDPLPNPWA (SEQ ID NO:595); GKILKDQDTLSQHGIHD (SEQ ID 

35 NO:597); GLTVHLVIKTQNRP (SEQ ID NO:598); SELQSQMQRQLLSNPEMM 
(SEQ ID NO:599); PEISHMLNNPDIMR (SEQ ID NO:600); and/or 
RQLIMANPQMQQUQRNP (SEQ ID NO:601). Polynucleotides encoding these 
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polypeptides are also encompassed by the invention. 
This gene is expressed primarily in breast. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
5 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, breast cancer. Similarly, polypeptides and antibodies directed to these 
_ polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of tumor systems, expression of this gene at significantly higher or lower 

10 levels may be routinely detected in certain tissues (e.g., cancerous and wounded 

tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i,e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

1 5 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treatment and diagnosis of some types of 
breast cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 103 

20 The translation product of tiiis gene shares sequence homology with secreted 

serine proteases and lysozyme C precursor, which is thought to be important in 
bacteriolytic function. In specific embodiments, polypeptides of the invention comprise 
the sequence: NLCHVDCQDLLNPNLLAGIHCAKRIVS (SEQ ID NO:602); 
LDGFEGYSLSDWLCLAFVESKFN (SEQ ID NO:603); 

25 NENADGSFDYGLFQINSHYWC:N (SEQ ID N0:6(H); and/or 

NLCHVDCQDLLNPNLLAGIHCAKRIVS (SEQ ID NO:605). Polynucleotides 
encoding these polypeptides arc also encompassed by the invention. 
This gene is expressed primarily in testes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
30 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, infection. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful m providing inmiunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
35 particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certam tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
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another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown m SEQ ID NO: 336 as residues: Ile-62 to Phe-70, Asn- 
5 78toAsn-84. 

The tissue disu-ibution and homology to lysozyme C precursor indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for boosting the 
moncyte-macrophage system and enhance the activity of immunoagents. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 104 
This gene is expressed primarily in apoptotic T-cell. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
15 not limited to, inMnune disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are xisefiil in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above ' 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
20 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
25 corresponding to this gene are usefiil for treatment and diagnosis of some immune 
disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO; 105 

The translation product of this gene shares sequence homology with ARI 

30 protein of Drosophila (accession 2058299; EMBL: locus DMARIADNE, accession 

X98309), which is thought to be important in axonal path-finding in the cenu-al nervous 
system. In specific embodiments, polypeptides of the invention comprise the sequence 
IREVNEVIQNPAT (SEQ ID NO:606); ITRIIJLSHFNWDKEKLMERYF 
DGNLEKLFA (SEQ ID NO:607); NTRSSAQDMPCQICYLNYPNSYF (SEQ ID 

35 NO:608); TGLECGHKFCMQCWSEYLTTKIMEEGMGQTISCPAHG (SEQ ID 
NO:614); CDILVDDNTVMRLITDSKVKLKYQHLrmSFVECNRLLKWCPAPD 
CHHWKVQYPDAKPV (SEQ ID NO:609); CDILXnDDNTVMRLITDSK 
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VKLKYQHLITNSFVECNRLLKWCPAPDCHHVVKV (SEQ ID NO:610); 
GCNHMVCRNQNCKAEFCWVCLGPWEPHGSAWYNCNRY^ 
RSRAALQRYL (SEQ ID N0:61 1); FYCNRYMNHMQSLRFEHKLYAQVKQ 
KMEEMQQHNMSWIEVQFLKKAVDVLCQCRATLMYT (SEQ ID NO: 612); 
YWAFYLKKNNQSnFENNQADLENATEVLSGYLERDISQDSL^^ 
RYCESR (SEQ ID NO:613) Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

This gene is expressed primarily in adult brain, and to a lesser extent in 
endometrial tumor, melanocytes, and infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases or injuries involving axonal path development. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the central nervous 
system, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to ARI protein indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for treatment of 
disease states or injuries involving axonal path development, including 
neurodegenerative diseases and nerve injury. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 106 

The translation product of this gene shares sequence homology with cytochrome 
b561 [Sus scrofa] which is thought to be an integral membrane protein of 
neuroendocrine storage vesicles of neurotransmitters and peptide hormones. 

This gene is expressed primarily in frontal cortex and to a lesser extent in 
rhabdomyosarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurological disorders. Similarly, polypeptides and antibodies directed to 
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these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the central nervous system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
5 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 339 as residues: 
10 Ser-18 toPro-24. 

The tissue distribution and homology to cytochrome b56I [Sus scrofa] indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 
treatment and diagnosis of neurological disorders. This gene may also be important in 
regulation of some types of cancers. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 107 

In specific embodiments, polypeptides of the invention comprise the sequence: 
MWGYLFVDAAWNFLGCLICGW (SEQ ID NO:615); MHHSSGNVSAIRSSILLL 
RXSLSYLGNCLRVSAIFVYFLLFLLLS (SEQ ID NO:616); and/or MDQALRGSPSE 
20 GFSTDPSPPQVGRQIPSFPPWRRLVLPKASGCFLEREWWLCVFKLRTRPGAEA 
HAYNSSILGGRGKGIT (SEQ ID NO:617). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. 

This gene is expressed primarily in pancreas tumor and to a lesser extent in 
cerebellum. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, pancreatic tumors. Similarly, polypeptides and antibodies directed to 
these polypeptides are usefiil in providing immunological probes for differential 

30 identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the endocrine system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual havmg 

35 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
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epitopes include those comprising a sequence shown in SEQ ID NO: 340 as residues: 
Pro-22 to Phe-33. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of pancreatic tumors. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 108 

This gene maps to chromosome 17 and therefore polynucleotides of the 
invention can be used in linkage analysis as a marker for chromosome 17. In specific 
embodiments, polypeptides of the invention comprise the sequence: 
10 MLPALASCCHFSPPEQAARLKKLQEQEKQQKVEFRKRMEKEVSDFIQDSGQIK 
KKFQPMNKIERSILHDVVEVAGLTSFSFGEDDDCRYVMIFKKEFAPSDEELDSY 
RRGEEWDPQKAEEKRNXKELAQRQ (SEQ ID NO:618); EEEAAQQGPVW 

SPASDYKDKYSHUGKGAAKDAAHMLQANKTYGCXPVANKRDTRSIEEAM]^ 
IRAKKRLRQSGE (SEQ ID NO:619); PPRRPAQLPLTPGAGQGAGRDKAAAIRA 
15 HPGAPPLNHLLP (SEQ IDNO:620); AVPQAGGKQVFDLSPLELGYVRGMCVCV 
(SEQ ID NO:621) and/or MLPALASCCHFSPPEQAARLKKLQEQEKQQKVEFRK 

rmekevsdhqdsgqikkkfqpmnkiersilhdvvevagltsfsfgedddcryv 
mifkkefapsdeeldsyrrgeewdpqkaeekrnxkeiaqrqeeeaaqqgpw^ 
spasdykdkyshligkgaakdaahmlqanktygcxpvankrdtrsiee;^^ 

20 IRAKKRLRQSGE (SEQ ID NO:622). Polynucleotides encoding these polypeptides 
are also encompassed by the invention. The translation product of this gene shares 
sequence homology with FSA-1 which may play a role as a structural protein 
component of the acrosome. 

This gene is expressed primarily in fetal kidney and sperm. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, male reproductive disorders, especially involving acrosomal disfunction. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

30 providing immunological probes for differential identification of the tissue(s) or cell 

type(s). For a number of disorders of the above tissues or cells, particularly of the male 
reproductive system, expression of tiiis gene at significantiy higher or lower levels may 
be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or anoflier tissue or 

35 cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, Le., the expression level in healtiiy tissue or bodily fluid from an 
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individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 341 as residues: Glu-8 to Asn-35. 

The tissue distribution and homology to FSA-1 indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for treatment of infertility due to 
5 acrosomal disfunction of sperm. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 109 

This gene is expressed primarily in pituitary and to a lesser extent in 
epididymus. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, male reproductive disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing inununological probes for 

1 5 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the male reproductive system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 

20 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 342 as residues: Met-1 to Trp-6. 

Because the gene is found in both pituitary and epididymus, this indicates that 

25 polynucleotides and polypeptides corresponding to this gene are useful for U:eatment 
and diagnosis of male reproductive disorders. This may involve a secreted peptide 
produced in the pituitary targeting the epididymus. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 110 
30 In specific embodimeiits, polypeptides of the invention comprise the sequence: 

LLCPVLNSGXSWNFPHPSQPEYSFHGFHSTRLWI (SEQ ID NO:623); and/or 
PSTPWFLFLLGLTCPFSTSHPRWDSIPP (SEQ ID NO:624). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 
This gene is expressed primarily in resting T-cells. . 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, T-cell disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
5 lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
conresponding to this gene are useful for treatment and diagnosis of certain immune 
disorders, especially those involving T-cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 111 

This gene is expressed primarily m cerebellum and whole brain and to a lesser 
extent in infant brain and fetal kidney. 

Therefore, polynucleotides and polypeptides of the invention arc usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurological disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of die cenu-al nervous system, expression of this gene at 
significantiy higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard ^ne expression level, i.e., die expression level 
in healtiiy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include diose comprising a sequence shown in SEQ ID NO: 344 as residues: 
Asp-48 to Gly-55. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and tiieatment of neurological 
disorders. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 112 

The translation product of this gene shares sequence homology with yeast 
mitochondrial ribosomal protein homologous to ribosomal protein sl5 of E.coli which 
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is thought to be important in the early assembly of ribosomes (See Accession No. 
M38016). This gene maps to chromosome 1, and therefore, may be used as a marker 
in linkage analysis for chromosome 1. 

This gene is expressed primarily in developmental tissues. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of die tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, development of cancers and tumors in addition to healing wounds. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

1 0 providing inmiunological probes for differential identification of the tissue(s) or ceil 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune and developmental expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 

1 5 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to ribosomalprotein sl5 of E. coli 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 

20 diseases related to the assembly of ribosomes in the mitochondria which is important in 
the translation of RNA into protein. Therefore, this indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and intervention of 
multiple tumors as well as in healing wounds which are thought to be under similar 
regulation as developmental tissues. Protein, as well as, antibodies directed against the 

25 protein have utility as tumor markers, in addition to immunotherapy targets, for the 
above listed timiors and tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 113 

The translation product of this gene shares sequence homology with human 

30 poliovims receptor precursors which are thought to be important m viral binding and 
uptake. Preferred polypeptide fragments comprise the following amino acid sequence: 
ELSISISNVALADEGEYTCSIFTMPVRTAKSLVTVLGIPQKPIITGYKSSL^ 
ATLNCQSSGSKPAARLTWRKGDQELHGEPTRIQEDPNGKTFTVSSSWFQVTR 
EDDGASIVCSVNHESLKGADRSTSQRIEXTLYTPTAMRPDPPHPREGQKLLLHC 

35 EGRGNPVPQQYLWEKEGSVPPLKMTQESALIFPFLNKSDSGTYGCTATSNMGS 
YKAYYTLNVND (SEQ ID NO:625). Also preferred are polynucleotide fragments 
encoding these polypeptide fragments (See Accession No. gnllPIDIdl002627). 
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This gene is expressed almost exclusively in human brain tissue. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
5 not limited to, susceptibility to viral disease and diseases of the CNS especially cancers 
of that system. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the central nervous system, expression of this gene at significantly higher or lower 

10 levels may be routinely detected in certain tissues (e.g., cancerous and wounded 

tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression leviel in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 

15 comprising a sequence shown in SEQ ID NO: 346 as residues: Leu-26 to Asp-37, Lys- 
53toSer-59, 

The tissue distribution and homology to poliovirus receptor precursors indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for the 
treatment and prevention of diseases that involve the binding and uptake of virus 
20 particles for infection. It might also be helpful in genetic therapy where the goal is to 
insert foreign DNA into infected cells. With the help of this protein, the binding and 
uptake of this foreign DNA might be aided. In addition, it is expected that over 
expression of this gene will indicate abnormalities involving the CNS,^particularly 
cancers of that system. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 114 

The translation product of this gene shares sequence homology with 
Y087_CAEEL hypothetical 28.5 KD protein ZK1236.7 in chromosome HI of 
Caenoriiabditis elegans in addition to alpha- 1 collagen type III (See Accession No. 

30 gil537432). One embodiment for this gene is die polypeptide fragment(s) comprising 
the following amino acid sequence: VPELPDRVHQLHQAVQGCALGRPGFPGGPTH 
SGHHKSHPGPAGGDYNRCDRPGQVHLHNPRGTGRRGQLHPTAGPGVHRRA 
CPSQQLPHRLGPGVPCPSPSLTPVLPSWTQSWCG LPGYTSSS (SEQ ID 
NO:630). An additional embodhnent is the polynucleotide fragment(s) encoding these 

35 polypeptide fragments 

This gene is expressed primarily in brain cells and to a lesser extent in activated 
B and T cells. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not liniited to, neurodegeneration and imunological disorders. Similarly, polypeptides 
5 and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the neural and immune systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 

10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from . 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 347 as residues: Glu-34 to Glu-39, Gly-51 to Ser-72, Ala-88 to Glu-93, Gln-100 

15 to Val-105. 

The tissue distribution and homology to Y087_CAEEL hypothetical 28.5 KD 
protein ZK1236,7 in chromosome HI of Caenorhabditis elegans as well as to a 
conserved alpha- 1 collagen type m protein indicates that pplynucleotides and 
polypeptides corresponding to this gene are useful for the detection and treatment of 

20 neurodegenerative disease states and behavioral disorders such as Alzheimer's Disease, 
Parkinson's Disease, Huntingtons' Disease, schizophrenia, mania, dementia, paranoia, 
obsessive compulsive disorder and panic disorders. Because the gene is expressed in 
cells of lymphoid origin, the natural gene product may be involved in immune 
functions. Therefore it may be also used as an agent for immunological disorders 

25 including arthritis, asthma, immune deficiency diseases such as AIDS, and leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 115 

The translation product of this gene shares sequence homology with alpha 3 
type DC collagen which is thought to be important in hyaline cartilage formation via its 

30 ability to uptake inorganic sulfate by cells (See Accession No. gil975657). One 

embodiment of this gene is the polypeptide fragment comprising the following amino 
acid sequence: SLRRPRSAAXQTLTTFLSSVSSASSSALPGSREPCDPRAPPPPR 
SGSAASCCSCCCSCPRRRAPlJiSPRGSKRRIRQREVVDLYNGMCLQGPAGVPG 
RDGSPGANGIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNY 

35 GroLGKIAECTFTKMRSNSALRVIJSGSLRLKay>IACC^ 

LPEAnYLDQGSPEMNSTINIHRTSSVEGLCEGIGAGLVDVAIWVGTCSDYPKG 
• DASTGWNSVSRIIIEELPK(SEQIDNO:634). An additional embodiment are the 
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polynucleotide fragments encoding this polypeptide fragment. 

This gene is expressed primarily in smooth muscle and to a lesser extent in 
synovial tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, dwarfism, spinal deformation, and specific joint abnormalities as well as 
chondrodysplasias i.e., spondyloepiphyseal dysplasia congenita, familial osteoarthritis, 
Atelosteogenesis type II, metaphyseal chondrodysplasia type Schmid and autoimmune 

10 disorders . Similarly, polypeptides and antibodies directed to these polypeptides are 
usefiil in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the skeletal system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 

15 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to alpha 3 type IX collagen indicates that 

20 polynucleotides and polypeptides corresponding to this gene are useful for the treatment 
and diagnosis of diseases associated with the mutation in this gene which leads to the 
many different types of chondrodysplasias. By the use of this product, the abnormal 
growth and development of bones of the limbs and spine could be routinely detected or 
treated in utero since the protein or muteins thereof could affect epithelial cells early in 

25 development and later the chondrocytes of the developing craniofacial structure. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 116 

The translation product of this gene shares sequence homology with retro virus- 
related reverse transcriptase which is thought to be important in viral replication. One 

30 embodiment for this gene is the polypeptide fragments comprising the following amino 
acid sequence: TKKENCRPASU\4NroTKILNKIIJ^NQ(SEQ^ An 
additional embodiment is the polynucleotide fragments encoding these polypeptide 
fragments (See Accession No. piriA25313IGNHULl). 

This gene is expressed primarily in human meningima. 

35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, retroviral diseases such as AIDS, and possibly certain cancers due to 
transactivation of latent cell division genes. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
5 the above tissues or cells, particularly of the inunune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

10 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to retrovirus-related reverse transcriptase 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
the detection and treatment of diseases and maladies associated with retroviral infection 
since a functional reverse transcriptase (RT) or RT-like molecule is an integral 

1 5 component of the retroviral life cycle. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 117 

The translation product of this gene shares sequence homology with an 
unknown gene from C. elegans, as well as weak homolog with mammalian metaxin, a 

20 gene contiguous to both thrombospondin 3 and glucocerebrosidase, is known to be 
required for embryonic development. Preferred polypeptide fragments comprise the 
following amino acid sequence: MCNLPIKVVCRANAEYMSPSGKVPXXHVGNQ 
' WSELGPWQFVKAKGHSLSDGl£EVQKAEMKAYMELVNNMLLTAELYLQWC 
DEATVGXITHXRYGSPYPWPLXHILAYQKQWEVKRKXKAIGWGKKTLDQVLE 

25 DVDQCCQALSQRLGTQPYFFNKQPTELDALVFGHLYTILTTQLTNDELSEKVKN 
YSNLLAFCRRI EQHYFEDRGKGRLS (SEQ ID NO:641); MCNLPIKVVCRANAE 
YMSPSGKVPXXHVGNQVVSELGPIVQFVK (SEQ ID NO:642),. Also preferred are 
polynucleotide fragments encoding these polypeptide fragments (See Accession No. 
gill326108). 

30 This gene is expressed primarily in fetal tissues and to a lesser extent in 

hematopoietic cells and tissues, including spleen, monocytes, and T cells. 

Therefore, polynucleotides and polypeptides of the invention are usefid as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

35 not limited to, cancer; lymphoproliferative disorders; inflammation; chondrosarcoma, 
and Gaucher disease. Similarly, polypeptides and antibodies directed to tiiese 
polypeptides are useful in providing immunological probes for differential identification 
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of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the hematopoietic and embryonic systems, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
5 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of cancer and other 

10 proliferative disorders. Expression in embryonic tissue and other cellular sources 

marked by proliferating cells indicates that this protein may play a role in the regulation 
or cellular division. Additionally, the expression in hematopoietic cells and tissues 
indicates that this protein may play a role m the proliferation, differentiation, and 
survival of hematopoietic cell lineages. Thus, this gene may be useful in the treatment 

15 of lymphoproliferative disorders, and in the maintenance and differentiation of various 
hematopoietic lineages from early hematopoietic stem and committed progenitor cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 118 

The translation product of this gene shares sequence homology with reverse 
20 transcriptase which is important in the synthesis of a cDNA chain from an RNA 
molecule, and is a method whereby the infecting RNA chains of retroviruses are 
transcribed into their DNA complements. One embodiment for this gene is the 
polypeptide fragment comprising the following amino acid sequence: 

MXXXNSHITimNVNGLNAPNERHRLANWlQSQDQVCCIQETHLTGRDTHRL 
25 KIKGWRKIYQANGKQKK (SEQ ID NO:647). An additional embodiment is the 
polynucleotide fragments comprising polynucleotides encoding these polypeptide 
fragments (See Accession No. gil2072964). 

This gene is expressed primarily in skin and to a lesser extent in neutrophils. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
30 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer, hematopoietic disorders; inflanmiation; disorders of immune 
surveillance. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing inmiunological probes for differential identification of the tissue(s) 
35 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the epidermis and/or hematopoietic system, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
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wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 
5 The tissue distribution and homology to reverse transcriptase indicates that 

polynucleotides and polypeptides corresponding to this gene are useful for cancer 
therapy. Expression in the skin also indicates that this gene is useful in wound healing 
and fibrosis. Expression by neutrophils also indicates that this gene product plays a role 
in inflammation and the control of immune surveillance (i.e. recognition of viral 
10 pathogens). Reverse transcriptase family members are also useful in the detection and 
treatment of AIDS. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 119 

The translation product of this gene shares sequence homology with reverse 
1 5 transcriptase which is important in the synthesis of a cDNA copy of an RN A molecule, 
and is a method whereby a retrovirus reverse-transcribes its genome into an inheritable 
DN A copy. 

This gene is expressed primarily in the frontal cortex of brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancer and neurodegenerative disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

25 of the above tissues or cells, particularly of the CNS and peripheral nervous system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 

30 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to reverse transcriptase suggest that this is 
useful in the treatment of cancer and ADDS. The expression in brain indicates that it 
plays a role in neurodegenerative disorders and in neural degeneration. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 120 

One embodiment of this gene has homology to a hypothetical protein in 
Schizosaccharomyces pombe (See Accession No. 2281980). Another embodiment for 
this gene is the polypeptide fragments comprising the following amino acid sequence: 
5 lYHLHSWIFFHFKRAFCMCFITMKVIHAHCSKLRKCXNAQISVFCn^ 
(SEQ ID NO:65 1 ). An additional embodiment is the polynucleotide fragments 
encoding these polypeptide fragments. This gene maps to chromosome 18, and 
therefore, may be used as a marker in linkage analysis for chromosome 18. 

This gene is expressed primarily in adult hypothalamus and to a lesser extent in 
10 infant brain. 

. Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurodegenerative disorders; endocrine function; and vertigo. Similarly, 

15 polypeptides and antibodies directed to these polypeptides are useful in providing 

inmiunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the brain, CNS and 
peripheral nervous system, expression of tiiis gene at significantiy higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 

20 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healtiiy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for the U-eatment and diagnosis of 

neurodegenerative disorders; diagnosis of tiunors of a brain or neuronal origin; 
treatments involving hormonal control of die entire body and of homeostasis, 
behavioral disorders, such as Alzheimer's Disease, Parkinson's Disease, Huntington's 
Disease, schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder and 

30 panic disorder. In addition, the gene or gene product may also play a role in the 

treatment and/or detection of developmental disorders associated with the developing 
embryo. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 121 

35 The translation product of this gene shares sequence homology with the human 

IRLB protein which is thought to be important in binding to a c-myc promoter element 
and thus regulating its transcription (See Accession No. gil33969). This gene maps to 
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chromosome 1, and therefore, may be used as a marker in linkage analysis for 
chromosome 1, 

This gene is expressed primarily in brain and breast and to a lesser extent in a 
variety of hematopoietic tissues and cells. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancer of the brain and breast; lymphoproliferative disorders; 
neurodegenerative diseases. Similarly, polypeptides and antibodies directed to these 

1 0 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the CNS, breast, and immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

1 5 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and diagnosis of cancer of the 

20 brain, breast, and hematopoietic system. In addition, it may be useful for the treatment 
of neurodegenerative disorders, as well as disorders of the hematopoietic system, 
including defects in immune competency and inflammation. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and 
immunotherapy targets for the above listed tumors and tissues. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 122 

The translation product of this gene shares sequence homology with an ATP 
synthase, a key component of the proton channel that is thought to be important in the 
translocation of protons across the membrane. 

30 This gene is expressed primarily in T cell lymphoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, T cell lymphoma. Similarly, polypeptides and antibodies directed to these 

35 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
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lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 

5 fluid from an individual not having the disorder. 

The tissue disUibution and homology to ATP synthase indicates Aat 
polynucleotides and polypeptides corresponding to this gene are useful for the treatment 
of defects in proton transport, homeostasis, and metabolism, as well as the diagnosis 
and treatment of lymphoma. Because the gene is expressed in cells of lymphoid origin, 

10 the natural gene product may be involved in immune functions. Therefore it may be also 
used as an agent for inmiunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, and leukemia 

FEATURES OF PROTEIN ENCODED BY GENE NO: 123 

15 This gene maps to chromosome 15, and therefore, may be used as a marker in 

linkage analysis for chromosome 15. 

This gene is expressed primarily in a variety of fetal tissues, including fetal 
liver, lung, and spleen, and to a lesser extent in a variety of blood cells, including 
eosinophils and T cells. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancer (abnormal cell proliferation); T cell lymphomas; and hematopoietic 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 

25 useful in providing inununological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the fetus and immune system, expression of this gene at significanfly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 

30 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and diagnosis of conditions 
35 involving cell proliferation. Expression of tiiis gene in fetal tissues, as well as in a 
variety of blood cell lineages indicates that it may play a role in eitiier cellular 
proliferation; apoptosis; or cell survival. Thus it may be useful in the management and 
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treatment of a variety of cancers and malignancies. In addition, its expression in blood 
cells suggest that it may play additional roles in hematopoietic disorders and conditions, 
and could be useful in treating diseases involving autoinununity, immune modulation, 
inunune surveillance, and inflammation.. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 124 

This gene is expressed primarily in placenta and to a lesser extent in pineal gland 
and rhabdomyosarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

10 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental, endocrine, and female reproductive disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inmiunological probes for differential identification of the tissue(s) or cell type(s). For 

1 5 a number of disorders of the above tissues or cells, particularly of the [insert system 
where a related disease state is likely, e.g., immune], expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual haying 

20 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those fcomprising a sequence shown in SEQ ID NO: 357 as residues: 
Leu-69 to Val-76. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for diagnosis and treatment of disorders in 
development. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and immunotherapy targets for the above listed tumors and 
tissues. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 125 
This gene is expressed primarily m benign prostatic hyperplasia. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of benign prostatic hyperplasia. Similarly, 
35 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the reproductive 
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system, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
5 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of benign prostatic 
hyperplasia. Protein, as well as, antibodies directed against the protein may show utihty 
10 as a tumor marker and immunotherapy targets for the above listed tumors and tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 126 

This gene is expressed primarily in apoptotic T-cells and to a lesser extent in 
suppressor T cells and ulcerative colitis. 

15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases involving premature apoptosis, and inmiunological and 
gastrointestinal disorders. Similarly, polypeptides and antibodies directed to these 

20 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the inunune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 

25 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having, the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of disorders involving 

30 inappropriate levels of apoptosis, especially in immune ceU lineages. Because the gene 
is expressed in cells of lymphoid origin, the natural gene product may be involved in 
immune functions. Therefore it may be also used as an agent for immunological 
disorders includmg arthritis, asthma, inmiune deficiency diseases (such as AIDS), and 
leukemia. 



35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 127 
This gene is expressed primarily in Raji cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the-tissue(s) or cell type(s) present in a 
5 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflanmiation and T cell autoimmune disorders. Similarly /polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the inrunune system, expression of 

10 this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

1 5 disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 360 as residues: Asp-23 to Gly-29. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of inflammation and T 
cell autoimmune disorders. Because the gene is expressed in cells of lymphoid origin, 

20 the natural gene product may be involved in immune functions. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, inmiune 
deficiency diseases (such as AIDS), and leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 128 
25 The translation product of this gene shares sequence homology with an C 

elegans coding region C47D12.2 of unknown function (See Accession No. 
gnllPIDIe348986). One embodiment for this gene is the polypeptide fragments 
comprising the following amino acid sequence: EDDGFNRSIHEVILKNrrWY 
SERVLTHSLGSLLILVVIRTIQYNNfrRTRDKYLHTNCLAA^ 
30 MQRnSlJSIiSKKHNKVI£QATQSUlGSLSSNDWU>DYAQ^^ 
l^IINSCLTNSIJfflNPNLVALLYKRDLFEQFRTHPSFQDI^ 
QAGS (SEQ ID NO:657); EDDGFNRSfflEVELKNITWYSERVLTEISLGSLLILW 
(SEQ ID NO:658); RTIQYN^^reTRDKYLHTNCIAALANMSAQro^ 
RIISLFSLLSKKHN (SEQ ID NO:659); KKHNKVLEQATQSLRGSLSSNDVPLPDY 
35 AQD (SEQ ID NO:661); SCLTNSLHHNPNLVYALLYKRDLFEQFRTHPSFQD 
IMQNIDLVISFFSSRLLQAGS (SEQ ID NO:660). An additional embodiment is the 
polynucleotide fragments encoding these polypeptide fragments. This gene maps to 
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chromosome 1 8, and therefore, may be used as a marker in linkage analysis for 
chromosome 18. 

This gene is expressed primarily in smooth muscle and to a lesser extent in fetal 

liver. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, atherosclerosis and other cardiovascular and hepatic disorders. Similarly, 
polypeptides land antibodies directed to these polypeptides are useful in providing 

10 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the circulatory 
system, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 

15 taken from an individual having such a disorder, relative to the standard gene 

expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to diis gene are useful for diagnosis and treatment of circulatory system 

20 disorders such as atherosclerosis, hypertension, and thrombosis . In addition, the tissue 
distribution indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for the detection and treatment of liver disorders and cancers (e.g. 
hepatoblastoma, jaundice, hepatitis, liver metabolic diseases and conditions that are 
attributable to the differentiation of hepatocyte progenitor cells). In addition the 

25 expression in fetus would suggest a useful role for the protein product in developmental 
abnormalities, fetal deficiencies, pre-natal disorders and various would-healing models 
and/or tissue trauma. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 129 
30 The translation product of this gene shares sequence homology with a ribosomal 

protein which is thought to be important in cellular metabolism, in addition to the 
Celegans protein F40F1 1 . 1 which does not have a known function at the current time 
(See Accession No. gnllPIDle244552 ). Preferred polypeptide fragments comprise the 
following amino acid sequence: 

35 MADIQTERAYQKQPTIFQNKJCRVLLGETGKEKLPRVTNKNIGLGFKDT 

PRRLLRGTYTOKXCPFTGNVSIRGRILSGWTQDEDAEDHCHPPRI^ALHPQVQ 
PLREAPQEHVCTPVPL LQGRPDR (SEQ ID NO:662); MKMQRTIVIRRDYLH 
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YIRKYNRFEKRHKNMSVHLSPCFRDVQIGDIVTVGECRPl^KTVRF^^ 
AAGTKKQFQKF (SEQ ID NO:663); MADIQTERAYQKQPTIFQNKKRVLLGET 
GK (SEQ ID NO:664); HCHPPRLS ALHPQVQPLREAPQEHVCTPVPL LQGRPDR 
(SEQ ID NO:666); NIGLGFKDTPRRLLRGTYmKKCPFTGNVSIRGRIl^GVVTQ 
5 (SEQ ID NO:669); MKMQRTIVIRRDYLHYIRKYNRFEKRHKNMSVHLSP (SEQ 
ID NO:667); CFRDVQIGDIVTVGECRPLSKTVRFNVLKVTKAAGTKKQFQKF 
(SEQ ID NO:668). Also preferred are polynucleotide fragments encoding these 
polypeptide fragments. 

This gene is expressed primarily in Wilm's tumor and to a lesser extent in 

1 0 thymus and stromal cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of die tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases affecting RNA translation. Similarly, polypeptides and 

1 5 antibodies directed to these polypeptides are useful in providing inmiunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the Wilm's tumors, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

20 fluid or spinal fluid) or another tissue or cell sample taken ftx)m an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 362 as residues: 
Thr-11 to Asp-20. 

25 The tissue distribution and homology to a ribosomal protein indicates that 

polynucleotides and polypeptides corresponding to this gene are useful for diseases 
affecting RNA translation. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 130 
30 The translation product of this gene shares sequence homology with a yeast 

DNA helicase which is thought to be important in global transcriptional regulation (See 
Accession No. gnllPIDIe243594). One embodiment for this gene is the polypeptide 
fragments comprising the following amino acid sequence: DFYDSDWNPTVDQQA 
MDRAHRLGK3TKQWVYRUCKGTEERILQRAKEKSEIQRM^ (SEQ ID 
35 NO:670); TRMIDLLEEYMVYRKI^TXRU)GSSKISERRDMVADFQNR^^ 
FVFLLSTRAGGLGINLTAXDTVHF (SEQ ID NO:671); TRMIDLLEEYMVYRK 
HTYXRLDGSSKISERRDM (SEQ ID NO:674); RRDMVADFQNRNDIFVFLL 
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STRAGGLGINLTAXDTVHF (SEQ ID NO:675) , IFYDSDWNFTVDQQAMD 
RAHRLGQTKQVTVYRLICKG (SEQ ID NO:676); RLICKGTIEERILQRAK 
EKSEIQRMVISG (SEQ ID NO:678). An additional embodiment is the polynucleotide 
fragments encoding these polypeptide fragments. / 
5 This gene is expressed primarily in amygdala. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases and disorders of the brain. Similarly, polypeptides and 

10 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the central nervous system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 

15 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to a DN A helicase indicates that 
20 polynucleotides and polypeptides corresponding to this gene are useful for diseases 
affecting RNA transcription, particularly developmental disorders and healing wounds 
since the later are though to approximate developmental transcriptional regulation. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 131 
25 This gene is expressed primarily in prostate and to a lesser extent in amygdala 

and pancreatic tumors. 

Therefore, polynucleotides and polypeptides of the invention are useftd as 
reagents for differential identification of the lissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
30 not limited to, prostate enlargement and gastrointestinal disorders, particularly of the 
pancreas and gall bladder. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inraiunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the reproductive system, expression of this gene at significantly higher or 
35 lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
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the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of prostate diseases, 
5 including benign prostatic hyperplasia and prostate cancer. In addition, the tissue 
dismbution in tumors of the pancreas indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and intervention of these tumors, in 
addition to other tissues where expression has been indicated. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
10 immunotherapy targets for the above listed tumors and tissues. 

^ FEATURES OF PROTEIN ENCODED BY GENE NO: 132 

This gene is expressed primarily in adult lung and to a lesser extent in 
hypothalamus. 

15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, pulmonary diseases and neurological disorders. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 

20 probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the pulmonary and respiratory 
systems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 

25 taken from an individual having such a disorder, relative to the standard gene 

expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of pulmonary and 

30 respiratory disorders such as emphysema, pneumonia, and pulmonary edema and 
emboli. In addition, the tissue distribution indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the detection/treatment of 
neurodegenerative disease states and behavioral disorders such as Alzheimer's Disease, 
Parkinson's Disease, Huntington's Disease, schizophrenia, mania, dementia, paranoia, 

35 obsessive compulsive disorder and panic disorder. In addition, the gene or gene 
product may also play a role in the treatment and/or detection of developmental 
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disorders associated with the developing embryo, sexually-linked disorders, or 
disorders of the cardiovascular system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 133 

5 This gene is expressed primarily in human liven 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cirrhosis of the liver and other hepatic disorders. Similarly, polypeptides 

10. and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the digestive system, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 

1 5 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
20 corresponding to this gene are useful for diagnosis and treatment of liver disorders such 
as cirrhosis, jaundice, and Hepatitus. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunodierapy targets for the above 
listed tissues. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 134 

. This gene is expressed primarily in fetal kidney and to a lesser extent in fetal 
liver and spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

30 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, development and regeneration of liver and kidney and immunological 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

35 the digestive and excretory systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
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another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 367 as residues: Pro-70 to Arg-77, Tyr- 
5 102toThr-107. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of diseases of the 
kidney and liver, such as cirrhosis, kidney failure, kidney stones, and liver failure, 
hepatoblastoma, jaundice, hepatitis, liver metabolic diseases and conditions that are 
10 attributable to the differentiation of hepatocyte progenitor cells. In addition the 

expression in fetus would suggest a useful role for the protein product in developmental 
abnormalities, fetal deficiencies, pre-natal disorders and various would-healing models 
and/or tissue trauma. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 135 

This gene is expressed primarily in brain, bone marrow, and to a lesser extent in 
placenta, T cell, testis and neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurodegenerative and immunological diseases and cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the nervous and 

25 immune systems, expression of this gene at significantiy higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

30 mdividual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 368 as residues: Met-1 to His-6. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useftd for the detection/treatment of neurodegenerative 
disease states and behavioral disorders such as Alzheimer's Disease, Parkinson's 

35 Disease, Huntingtons Disease, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder and panic disorder. In addition, the gene or gene product may also 



wo 98/54963 



PCT/US98/11422 



105 

play a role in the treatment and/or detection of developmental disorders associated with 
the developing embryo, or sexually-linked disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 136 

5 Translatation product of this gene is homologous to the human WD repeat 

protein HANI 1. Preferred polypeptide fragments comprise the following amino acid 
sequence: 

MSLHGKRKEIYKYEAPWTVYAMNWSVRPDKRFRLALGSFVEEYNNKVQLVG 
LDEESSEnCRNTFDHPYPTTKLMWIPDTKGVYPDLLATSGDYLRVWRVGETET 

10 RLECLLNNNKNSDFCAPLTSFDWNEVDPYLLGTSSIDTTCTIWGLETGQVLGRV 
NLVSGHVKTQLIAHDKEVYDIAFSRAGGGRDMFASVGADGSVRMFDLRHLEH 
STIIYEDPQHHPlIJU.CWNKQDPNYI^TMAMDGMEWIlJ)VRVPAHLXPGTriE 
HVSMALLGPHIHPATSALQRMTTRLSSGTSSKCPEPLRTLSWFTQLXGEINNVQ 
WASTQPELSPSATTTAWRYSECSVGGAVPTRQGLLYFLPLPHPQS (SEQ ID 

15 NO:679);MSLHGKRmYKYEAPWTVYAMNWSVRPDKRFRLALGSFV 

EEYNNKVQLVGLDEESSEHCRNTFDHPYFTTKLMWIPiyrKGVYPDLLATSGDY 
UiVWRVGETETRl^CLLNNNKNSDFCAPLTSFDWNEVDPYLL (SEQ ID 
NO:680); SFDWNEVDPYLLGTSSIDTTCnWGLETGQVLGRVNLVSGHVK 
TQLIAHDKEVYDL\FSRAGGGRDMFASVGADGSVRMFDLRHLEHSTnYEDPQH 

20 HPLLRLCWNKQDPNYLATMAMDGMEVVTLDVRVPAHLXPGTncSEQID 
NO:681); VGADGSVRMFDLRHLEHSTIIYEDPQHHPLLRLCWNKQDPNYLA 
TMAMDGMEVmOVRVPAHIJffGTTIEHVSMALLGPHIHPATSALQRMTTRLS 
SGTSSKCPEPLRTLSWPTQLXGEINNVQWASTQPELSPSATTTAWRYSECSVG 
GAVPTRQGLLYFLPLPHPQS (SEQ ID NO:682). Also preferred are polynucleotide 

25 fragments encoding these polypeptide fragments. 

This gene is expressed primarily in placenta, embryo, T cell and fetal lung and 
to a lesser extent in endothelial, tonsil and bone marrow. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

30 biological sample and for diagnosis of diseases and conditions, which iiKlude, but are 
not limited to, immunological and developmental diseases in addition to cancers. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing inununological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 

35 immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
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cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 369 as residues: Gly-19 to Gln-28, Pro-36 to Phe-42. 
5 The tissue distribution in tumors of colon, ovary, and breast origins indicates 

that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and intervention of these tumors, in addition to other tumors where 
expression has been indicated. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the above 
10 listed tumors and tissues. Because the gene is expressed in cells of lymphoid origin, the 
natural gene product may be involved in immune functions. Therefore it may be also 
used as an agent for unmunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, and leukemia. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 137 

This gene is expressed primarily in TNF and INF induced epithelial cells, T 
cells and kidney. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammatory conditions particularly inflammatory reactions in the 
kidney. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of renal 

25 system, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

30 individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 370 as residues: Thr-67 to Gly-72, Gln-132 to Ala- 
145, Arg-150toPro-157. 

The tissue distribution indicates that the protein products of this gene are useful 
for treating the damage caused by inflanmiation of the kidney. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 138 

This gene maps to chromosome 1, and therefore, may be used as a marker in 
linkage analysis for chromosome 1 (See Accession No. D63485). 

This gene is expressed primarily in breast cancer and colon cancer and to a 
5 lesser extent in thymus and fetal spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancers, especially of the breast and colon tissues. Similarly, 
10 polypeptides and antibodies directed to these polypeptides are useful in providing 

inununological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significandy higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
15 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in tumors of colon and breast origins indicates that 
20 polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and intervention of these tumors, in addition to other tumors where expression has been 
indicated. Protein, as well as, antibodies directed against the protein may show utility as 
a tumor marker and/or immunotherapy targets for the above listed tumors and tissues. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 139 

This gene maps to chromosome 17. and therefore, can be used as a marker for 
linkage analysis from chromosome 17. 

This gene is expressed primarily in CD34 positive cells, and to lesser extent in 
activated T-cells and neutrophils. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but arc 
not limited to, inmiunologically related diseases and hematopoietic disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

35 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of tiie above tissues or cells, particularly of the inunune system 
and hematopoietic system, expression of this gene at significandy higher or lower levels 
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may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
5 fluid from an individual not having the disorder. 

The tissue distribution in CD34, T-cell and neutrophils indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for treatment 
and diagnosis of hematopoietic disorders and immunologically related diseases, such as 
anemia, leukemia, inflammation, infection, allergy, immunodeficiency disorders, 
10 arthritis, asthma, immune deficiency diseases such as AIDS. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 140 

This gene was recendy cloned by another group, who called the gene 
KIAA0313 gene. (See Accession No. dl021609.) Preferred polypeptide fragments 

IS comprise the amino acid sequence: 

LYATATVISSPSTEXLSQDQGDRASLDAADSGRGSWTSCSSGSHDNIQTIQ 
HQRSWETLPFGHTHFDYSGDPAGLWASSSHMDQIMFSDHSTKYNRQNQSRES 
LEQAQSRASWASSTGYWGEDSEGDTGTIKRRGGKDVSffiAESSSLTSVTTEETK 
PVPMPAHIAVASSTTKGLIARKEGRYREPPPTPPGYIGIPITDFPEGHSHPARKP 

20 PDYNVALQRSRMVARSSDTAGPSSVQQPHGHPTSSRPVNKPQWHKXNESDPR 
LAPYQSQGFSTEEDEDEQVSAV (SEQ ID NO:683); HMDQIMFSDHSTKYNRQ 
NQSRESLEQAQSRASWASSTGYWGE (SEQ ID NO:684); SVTTEETKPVPMP 
AHIAVASSTTKGLIARKEGRYREPPPTPPGYIGIPITD (SEQ ID NO:685); and 
VALQRSRMVARSSDTAGPSSVQQPHGHPTSSRPVNKPQW 

25 HKXNESDPRLAPYQSQGF (SEQ ID NO:686). Also preferred are polynucleotide 
fragments encoding tiiese polypeptide fragments. This gene maps to chromosome 4, 
and therefore, may be used as a marker in linkage analysis for chromosome 4 (See 
Accession No. AB00231 1 ). 

This gene is expressed primarily in ovarian cancer, tumors of the Testis, brain. 

30 and colon. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of die Ussue(s) or cell type(s) present m a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, ovarian, testicle, brain and colon cancers. Similarly, polypeptides and 
35 antibodies directed to these polypeptides are useful in providing inununological probes 
for differential identification of die tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of die male and female reproductive systems, 
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expression of this gene at significantly higher or lower levels may be routinely, detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
5 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in tumors of colon, ovary, testis, and brain origins 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and intervention of these tumors, in addition to other tumors where 
10 expression has been indicated. Protein, as well as, antibodies directed against the 

protein may show utility as a tumor marker and/or immunotherapy targets for the above 
listed tumors and tissues. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 141 

15 This gene is expressed primarily in spleen and colon cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, colon cancer and immunological disorders. Similarly, polypeptides and 

20 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the gastrointestinal trace and immune 
systems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 

25 serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in tumors of colon, ovary, and breast origins indicates 

30 that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and intervention of these tumors, in addition to other tumors where 
expression has been indicated. Protein, as well as, antibodies directed against the 
protein may show utUity as a tumor marker and/or inununotiierapy targets for the above 
listed tumors and tissues. 



35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 142 

Translation product is homologous to T cell translocation protein, a putative zinc 
finger factor (See Accession No. 340454), as well as to the G-protein coupled receptor 
TM5 consensus polypeptide (See Accession No. R50734). Preferred polypeptide 
5 fragments comprise the following amino acid sequence: 

CLLFVFVSLGMRCLFWTIVYNVLYLKHKCNTVLLCYHLCSI (SEQ ID NO:687); 
ACSmPAFEMVMRAKDNVYHLDCFACQLCNQRXCVGDKFFLKNNXXLCQT 
DYEEGLMKEGYAPXVR (SEQ ID NO:688). Also preferred are polynucleotide 
fragments encoding these polypeptide fragments. 

10 This gene is expressed primarily in fetal brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurological disorders including brain cancer. Similarly, polypeptides 

1 5 and antibodies directed to these polypeptides are useful in providing inmiunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the Central Nervous System, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 

20 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
25 corresponding to this gene are useful for the detection/treatment of neurodegenerative 
disease states and behavioral disorders such as Alzheimer's Disease, Parkinson's 
Disease, Huntingtons Disease, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder and panic disorder. In addition, the gene or gene product may also 
play a role in the treatment and/or detection of developmental disorders associated with 
30 the developing embryo. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 143 

Translation product for this gene has significant homology to the Fas ligand, 
which is a cysteine-rich type II transmembrane protein/tumor necrosis factor receptor 
35 homolog. Mutations within this protein have been shown to result in generalized 
lymphoproliferative disease leading to the development of lymphadenopathy and 
autoimmune disease (See Medline Article No. 94185175). Preferred polypeptide 
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fragments comprise the following amino acid sequence: 

SALSEPGAPDRRRPCPESVPRRPDDEQWPPPTALCLDVAPLPPSS (SEQ ID 
NO:689). Also preferred are polynucleotide fragments encoding these polypeptide 
fragments (See Accession No. 473565). 
5 This gene is expressed primarily in osteoblasts, lung, and brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, osteoblast-related, pubnonary, neurological, and immunological 

10 diseases. Similarly, polypeptides and antibodies directed to these polypeptides are 

useful in providing immunological probes for differential identification of the tissue(s) 
. or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the skeletal and nervous systems, expression of this gene at significandy higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 

15 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual havmg such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 376 as residues: Trp-33 to Thr-40, Lys- 

20 45toIle-63. 

The tissue distribution in osteoblasts, lung, and brain combined with its 
homology to the Fas ligand indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and intervention of these tumors, in 
addition to other tumors where expression has been indicated. Protein, as well as, 

25 antibodies directed against the protein may show utility as a tumor marker and/or 

immunotherapy targets for the above listed tumors and tissues. Because the Fas hgand 
gene is known to be expressed in cells of lymphoid origin, the natural gene product 
may be involved in inunune functions. Therefore it may be also used as an agent for 
immunological disorders including asthma, immune deficiency diseases such as AIDS 

30 and leukemia, and various autoinmiune disorders including lupus and arthritis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 144 

This gene shares sequence homology with a 21 .5 KD transmembrane protein in 
the SEC15-SAP4 intergenic region of yeast. (See Accession No. 1723971.) Preferred 
35 polypeptide fragments comprise the amino acid sequence: 

AHASESGERWWACCGVRFGLRSIEAIGRSCCHDGPGGLVANRGRRFKWAIEL 
SGPGGGSRGRSDRGSG(2GDSLYPVGYLDKQVPDTSVQETDRILVEKRCWDIAL 
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GPLKQIPMNmMYMAGNTISIFPTMMVCMMAWR^ 
KFLQGLVYLIGNLMGLALAVYKCQSMGLLPTHASDWLAFEP^^ 
GLLL (SEQ ID NO:691); PVGYLDKQVPDTSVQETDRILVEKRCWDIALGPLKQ 
IPMNLH (SEQ ID NO:693); and ATFKMLESSSQKFLQGLVYLIGNLMGLALAV 
5 YKCQSMGLLPTHASD (SEQ ID NO:692). Also preferred arc polynucleotide 
fragments encoding these polypeptide fragments. 

This gene is expressed primarily in osteoclastoma, hemangiopericytoma, liver, 

lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
10 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, osteoclastoma, hemangiopericytoma, liver and lung tumors. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inmiunological probes for differential identification of the above tissue(s) or cell 
1 5 type(s). For a number of disorders of the above tissues or cells, particularly of the lung 
and liver systems, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
20 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosing osteoclastoma, 
hemangiopericytoma, liver and lung tumors. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 145 

Translation product of this gene shares homology with the glucagon-69 gene 

which may indicate this gene plays a role in regulating metabolism. (See Accession No. 

A6()318) One embodiment for this gene is the polypeptide fragments comprising the 
30 following amino acid sequence: 

PTTKLDIMEKKKfflQIRFPSFjrHKLVDSGRMRSKRET^ (SEQ ID 

NO:694). An additional embodiment is the polynucleotide fragments encoding these 

polypeptide fragments. 

This gene is expressed primarily in brain, kidney, colon, and testis. 
35 Therefore, polynucleotides and polypeptides of the invention are usefid as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, brain, kidney, colon, and testicular cancer. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the male reproductive system, neurological, 
5 circulatory, and gastrointestinal systems, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 

10 or bodily fluid from an individual not having the disorder. 

The tissue distribution in tumors of brain, kidney, colon, and testis origins, 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and intervention of these tumors, in addition to other tumors wher^ 
expression has been indicated. Protein, as well as, antibodies directed against the 

1 5 protein may show utility as a tumor marker and/or immunotherapy targets for the above 
listed tumors and tissues. The tissue distribution indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the detection/treatment of 
neurodegenerative disease states and behavioral disorders such as Alzheimer* s Disease, 
Parkinson's Disease, Huntingtons Disease, schizophrenia, mania, dementia, paranoia, 

20 obsessive compulsive disorder and panic disorder. In addition, the gene or gene 
product may also play a role in the treatment and/or detection of developmental 
disorders associated with the developing embryo, sexually-linked disorders, or 
disorders of the cardiovascular system. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 146 

The translation product of this gene shares sequence homology with goUath 
protein which is thought to be important in the regulation of gene expression during 
development. Protein may serve as a transcription factor. One embodiment for this gene 
is the polypeptide fragments comprising the following amino acid sequence: 

30 TEHEAVMITELRGKDIl^YLEKMSVQMTIAVGTRMPPKNFSRGSL^ 
LMnSSAWUFYHQmYTNARDRNQRRLGDAAKKAISi^ 
PDFDHCAVCIESYKQNDVVRILPCKHVFHKSCVDPWl^EHCTCPM 
LGIV (SEQ ID NO:695); TEHHAVMITELRGKDILSYI^KNISVQNmAVGTRM^ 
PKNFSRGSLVFVSISFIVLM HSSAWLIFYF (SEQ ED NO:697); SISHVLMHSSA 

35 WUITHQmYTNARDRNQRRLGDAAKKAISKLTTRT (SEQ ID 

NO:698); VKKGDKETDPDFDHCAVCffiSYKQNDVVRnJ>CKHVFHKSCVDP 
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WLSEHCTCPMCKLNILKALGIV (SEQ ID NO:699). An additional embodiment is 
the polynucleotide fragments encoding these polypeptide fragments (See Accession No. 
157535). Moreover, another embodiment is the polynucleotide fragments encoding 

these polypeptide fragments: 

MTHPGTEHIIAVMTTELRGKDILSYLEKNISVQMTIAVGTiyVIPPKNFSRGS 

LVFVSISFTVLNfflSSAWLIFYHQmYTNARDRNQRRLGDAAKKAISKLTTRTV 

KKGDKETDPDFDHCAVCIESYKQNDVVRE.PCKHVFHKSCVDPWLSEHCTCP 

MCKLNILKALGIVPNLPCTDNVAFDMERLTRTQAVNRRSALGDLAGDNSLGLE 

PLRTSGISPLPQDGELTPRTGEINIAVTKEWFIIASFGLLSALTLCYMIIRATASLN 

ANEVEWF (SEQ ID NO:696);MTHPGTEHIIAVMITELRGKDILSYLEKNISVQM 

TIAVGTRMFPKNFSRGSLVFVSISnVLMnSSAWLIFYHQKIRYTNARDRNQRR 

LX3DAAKKAISKLTTRT (SEQ ID NO:700); AAKKAISKLTTRTVKKGDKE 

TDPDFDHCAVOESYKQNDVVRILPCKHVFHKSCVDPVVLSEHCTCPMCKLNIL 

KALGIVPNLPC (SEQ ID NO:701); TQAVNRRSALGDLAGDNSLGLEPLRTSGI 

SPUH3IXJELTPRTGEI^^AVTKE^^TIIASFGIiSALTIX:YMIIRATASLN 

F (SEQ ID NO:702); PLHGVADHLGCDPQTRFFVPPNIKQWIALLQRGNCTF 

KEKISRAAFHNAVAVVIYNNKSKEEPVT\miPGTEHnAVMITELRGKDILSYLE 

KNISVQMTIAVGTRMPPKNFSRGSLVFVSISnVLmSSAWLIFYnQKIRYTNA 

RDRNQRRLGDAAKKAISKLTTRTVKKGDKETDPDFDHCAVCIESYKQNDVVRI 

LPCKHVFHKSCVDPWLSEHCTCPMCKLNILKALGIVPNLPCTDNVAFDMERLT 

RTQAVNRRSALGDLAGDNSLGLEPLRTSGISPLPQDGELTPRTGEINIAVTKEW 

FnASFGLLSALTLCYMIIRATASLNANEVEWF(SEQ ID NO:703); and 

HGVADHLGCDPQTRFFVPPNIKQWIALLQRGNCTFKEKISRAAFHNAVAWIY 
NNKSKEE (SEQ ID NO:704). An additional embodiment is the polynucleotide 
fragments encoding these polypeptide fragments. When tested against Jurkat cell lines, 
supematants removed from cells containing this gene activated the GAS pathway. 
Thus, it is likely that this gene activates immune cells through the JAKS/STAT signal 
transduction pathway. 

This gene is expressed primarily in macrophage, breast, kidney and to a lesser 
extent in synovium, hypothalamus and rhabdomyosarcoma. 

Therefore, polynucleotides aind polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) piesent in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, schizophrenia and cancer. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, pardcularly of the immune and neural system, expression of this gene at 
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significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
5 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to zinc finger protein indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the treatment 
of schizophrenia, kidney disease and other cancers. The tissue distribution in 
macrophage, breast, and kidney origins indicates that polynucleotides and polypeptides 

10 corresponding to this gene are useful for diagnosis and intervention of tumors within 
these tissues, in addition to other tumors where expression has been indicated. Protein, 
as well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tumors and tissues. Because the gene 
is expressed in cells of lymphoid origm, the natural gene product may be involved in 

15 immune functions. Therefore it may be also used as an agent for immunological 

disorders including arthritis, asthma, inunune deficiency diseases such as AIDS, and 
leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 147 

20 The translation product of this gene shares sequence homology with HNP36 

protein, an equilibrative nucleoside transporter, which is thought to be important in 
gene transcription as well as serving as an important component of the nucleoside 
U-ansport apparatus (See Accession No. 1845345). One embodiment for this gene is 
the polypeptide fragments comprising the following amino acid sequence: 

25 MSGQGLAGFFASVAMICAIASGSELSESAFGYFrrACAVnLTnCYLGLPRLEFYR 
YYQQLKLEGPGEQETKLDLISKGEEPRAGKEESGVSVSNSQPTNESHSIKAILK 
MSVLAFSVCHFTITIGMFPAVTVEVKSSUGSSTWERYFff 
RSLTAVFMWPGKDSRWIJSWXLARLVFVP^ 
FFMAAFAFSNGYIASLCMCFGPKKVKPAEAETAEPSWPSSCX^^ 

30 PSCSGQLCDKGWTEGLPASLPVCLLPLPSARGDPEWSGGFFF (SEQ ID 
NO:705); MSGQGLAGFFASVAMICAIASGSELSESAFGYFrrACAVnLTnC 
YLGLPRLEFYRYYQQLKLE GPGEQETKLDLISKGEEPRAGKEESGVSVSNSQ 
FTNESHSI (SEQ ID NO:706); SGVSVSNSQPTNESHSIKAILKNISVLAFS VCH 
FTmGMFPAWVEVKSSIAGSSTWERYFIPVSCK-TFN^ (SEQ ID 

35 NO:707),TIGMFPAVTVEVKSSIAGSSTWERYFIPVSCFLTFNIFDWLGRSL^^ 
MWPGKDSRWLPS WXLARLVFVPLLLLCNIK PRRYLTWFEHDA (SEQ ID 
NO:708);FGPKKVKPAEAETAEPSWPSSCWWHWGLFSPSCSGQLCDK 
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GWTEGLPASLPVCLLPLPS ARGDPEWSGGFFF (SEQ ID NO:709). An additional 
embodiment is the polynucleotide fragments encoding these polypeptide fragments. 

This gene is expressed primarily in eosinophils and aortic endothelium and to a 
lesser extent in umbilical vein endothelial cell and thymus. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, hematopoietic disease. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 

10 identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the circular system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

1 5 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to HNP36 protein indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the treatment 
of blood neoplasias and other hematopoietic disease. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 148 

This gene is expressed primarily in breast cancer cell lines, thymus stromal 
cells, and ovary. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, endocrine and female reproductive system diseases including breast 
cancer. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing inununological probes for differential identification of the tissue(s) or ceU 
30 type(s). For a number of disorders of the above tissues or cells, particularly of the 

endocrine system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or * 
cell sample taken from an individual having such a disorder, relative to the standard 
35 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of endocrine 
disorders. In addition, the tissue distribution in tuniors of thymus, ovary, and breast 
origins indicates that polynucleotides and polypeptides corresponding to this gene are 
5 useful for diagnosis and intervention of these tumors, in addition to other tumors where 
expression has been indicated. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or inmiunotherapy targets for the above 
listed tumors and tissues 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 149 

Translation product of this gene has homology to pmtl and pmt 2, two 
conserved schizosaccharomyces pombe genes. One embodiment for this gene is the 
polypeptide fragments comprising the following amino acid sequence: 
DDDGFEIVPffiDPAKHRILDPEGLALGAVIASSKKAKRDLIDNSFNRYTFNEDE 

15 ELPEWFVQEEKQHRKQLPVGKKEVEHYRKRWREINARPIXXXXXXXXXXX 
XXXXXXLEQTRKKAEAWNTVDDCRTRES (SEQ ID NO:710); 
DDDGFEIWffiDPAKHRILDPEGLALGAVIASSKKAKRDLroNSFl^TO (SEQ 
ID NO:71 1); KRWREINARPIXXXXXXXXXXXXXXXXXLEQTRKKAE 
AVVNTVDDCRTRES (SEQ ID NO:712). An additional embodiment is the 

20 polynucleotide fragments encoding these polypeptide fragments (See Accession No. 
el216734). 

This gene is expressed primarily in retina and ovary and to a lesser extent in 
brreast cancer cell, epididymus and osteosarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neuronal growth disorders, cancer and reproductive system disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing inununological probes for differential identification of the tissue(s) or cell 

30 type(s). Forantimberofdisordersof the above tissues or cells, particularly of the 

neural and reproductive system, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

35 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 382 as residues: Met-1 to Gly-7. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis or treatment of reproductive 
system disease and cancers. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 150 

One embodiment for this gene is the polypeptide fragments comprising the following 
amino acid sequence: 

MIKDKGRARTALTSSQPAHLCPENPLLHLKAAVKEKKRNKKKKTIGSPK^ 
PLNNKLLNSPAKTLPGACGSPQKLIDGFLKHEGPPAEKPLEELSASTSGVPGLS 

10 SLQSDPAGCVRPPAPNLAGA\^FNDVKTLLREWITTISDPMEEDILQVVK^ 
LIEEKDI^KlX>LVIKYMKRLMQQSVESVWNMAro 
VT (SEQ ID NO:713); MIKDKGRARTALTSSQPAHLCPENPLLHLKAAVKE 
KKRNKKKKTIGSPKRIQ (SEQ ID NO:714); KRIQSPLNNKLLNSPAKT 
LPGACGSPQKLIDGFLKHEGPPAEKPLEELSASTSGVPGLSSLQSDPAGCVRPP 

15 APNLAGAVEFNDVKTLLREWnnSDPM (SEQ ID NO:715); 
TKDPMEEDILQWKYCTOLIEEKDLEKUDLVIKYMKRIJ^ 
SVWNMAFDFILDNVQVVU3(3TYGSTLKVT( An additional 

embodiment is the polynucleotide fragments encoding these polypeptide fragments. 
This gene is expressed primarily in 12 week embryo and to a lesser extent in 

20 hemangiopericytoma and frontal cortex. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, growth disorders and hemangiopericytoma. Similarly, polypeptides and 

25 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of tiietissue(s) or cell type(s). For a number of disorders 
of Uie above tissues or cells, particularly of the circular and neural system, expression 
of this gene at significantiy higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 

30 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to die standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 383 as residues: Leu-4 to Lys-11. 

35 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to tiiis gene are useful for die treatment of growtii disorders, 
hemangiopericytoma and other soft tissue tumors. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 151 

The translation product of this gene has been found to have homology to a 

human DNA mismatch repair protein PMS3, Preferred polypeptide fragments comprise 

the following amino acid sequence: FCHDCKFPEASPAMNCEP (SEQ ID NO:717). 

Also preferred are polynucleotide fragments encoding these, polypeptide fragments (See 

Accession No. R95250). 

This gene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differentia] identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, lymphoma, immunodeficiency diseases, and cancers resulting from 
genetic instability. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include tiiose 
comprising a sequence shown in SEQ ID NO: 384 as residues: Met-1 to Lys-6. 

The tissue distribution in neutrophils and the sequence homology indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis of 
Hodgkin's lymphoma, since the elevated expression and secretion by the tumor mass 
may be indicative of tumors of this type. Additionally the gene product may be used as 
a target in the inununotherapy of the cancer. Because the gene is expressed in cells of 
lymphoid origin, the natural gene product may be involved in immune functions. 
Therefore it may be also used as an agent for immunological disorders including 
arthritis, asthma, immune deficiency diseases such as AIDS, and leukemia. 
Furthermore, its homology to a known DNA repair protein would suggest gene may be 
useful in establishing cancer predisposition and prevention in gene therapy applications. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 152 

This gene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, infectious diseases and lymphoma. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the inunune system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment of inflanmiation and infectious 
diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 153 

One embodiment for this gene is the polypeptide fragments comprising the 
following amino acid sequence: 

MASSVPAGGHTRAGGIFLIGKLDLEASLFKSFQWLPFVLRKKC 
NFFCWDSSAHSLPLHPLSASCSAPACHASDTHLLYPSTRALCPSIFAWLVAPHS 
VFRTNAPGPTPSSQSSPVFPVFPVSFMALIVCXLVCC (SEQ ID NO:720); 

MASSVPAGGHTRAGGIFLIGKLDLEASLFKSFQWLPFVLRKKCNFFCWDSSAH 
SLPLHPLSASCSAPACHA (SEQ ID NO:721);FAWLVAPHSVFRTNAPGPTPS 
SQSSPVFPVFPVSFMALIVCXLVCC (SEQ ID NO:722). An additional embodiment 
is the polynucleotide fragments encodmg these polypeptide fragments. 
This gene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammation and infectious disease. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a nimiber of disorders 
of the above tissues or cells, particularly of the inunune system, expression of this gene 
at significantiy higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual havmg 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
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epitopes include those comprising a sequence shown in SEQ ID NO: 386 as residues: 
Ser-11 toPro-17. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment of infectious diseases and 
S inflammation. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 154 

This gene is expressed in multiple tissues including ovary, uterus, adipose 
tissue, brain, and the liver. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, uterine, ovarian, brain, and liver cancer. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 

15 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the female reproductive isystem, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 

20 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution of this gene indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnostic or therapeutic uses in 
25 the treatment of the female reproductive system, obesity, and liver disorders, 
particularly cancer in the above tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 155 

This gene maps to chromosome 3, and therefore, may be used as a marker in 
30 linkage analysis for chromosome 3 (See Accession No. D87452). 

This gene is expressed in multiple tissues including brain, aortic endothelial 
cells, smooth muscle, pituitary, testis, melancytes. spleen, nertrophils, and placenta. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immunological disorders including inununodeficiencies, cancers of the 
brain and the female reproductive system, as well as cardiovascular disorders, such as 
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atherosclerosis and stroke. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central nervous and immune systems, expression of this gene at 
5 significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 
10 The tissue distribution suggest that polynucleotides and polypeptides 

corresponding to this gene are useful in treatment/detection of disorders in the nervous 
system, including schizophrenia, neurodegeneration, neoplasia, brain cancer as well as 
cardiovascular and female reproductive disorders including cancer within the above 
tissues. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 156 

The translation product of this gene shares sequence homology with the human 
gene encoding cytochrome b561 (See Accession No. P10897). Cytochrome b561 is a 
transmembrane electron transport protein that is specific to a subset of secretory vesicles 
20 containing catecholamines and amidated peptides. This protein is thought to supply 
reducing equivalents to the intravesicular enzymes dopamine-beta-hydroxylase and 
alpha-peptide amidase. Preferred polypeptides of the invention comprise the amino acid 
sequence: 

MAMEGWRFLALLGSALLVGFI^VIFALVWLHYREGLGWDGSALEFNA^ 
25 VLMVTGFVHQGlAHVYRLPWTWKCSKLLMKSmAGLNAVAAILAn^ 
NHNVNMANMYSUISWVGUAVICYIXQIJL^GFSV^ 
YSGWIFGTVIATAUWGLTEKLIFSLRDPAYSTFPPEGVFVNTLGIXILW 
WI\nrRPQWKRPKEPNSTILHPNGGTEQGARGSMPAYSGNNMDKSDSEL 
NSEVAARKRNLALDEAGQRSTM (SEQ ID NO:724); as well as antigenic fragments 
30 of at least 20 amino acids of this gene and/or biologically active fragments. Also 
preferred are polynucleotide fragments encoding these polypeptide fragments. 
This gene is expressed primarily in anergic T-cells. 
Therefore, polynucleotides and polypieptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions, which include, but are 
not hmited to, immune system and metabolism related diseases. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
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probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
5 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein product or RNA of this gene is 
10 useful for treatment or diagnosis of inmiune system and metabolic diseases or 
conditions including Tay-Sachs disease, phenylketonuria, galactosemia, various 
porphyrias, and Hurler's syndrome. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 157 

1 5 The translation product of this gene shares sequence homology with collagen 

which is important in mammalian development. This gene also shows sequence 
homology with bcl-2. (See Accession No. P80988.) Preferred polypeptide fragments 
comprise the amino acid sequence: PGRAGPSPGLSLQLPAEPGHPAGNLAPL 
TSRPQPLCRIPAVPG (SEQ ID NO:725). Also preferred are polynucleotide 

20 sequences encoding this polypeptide fragment. 

This gene is expressed primarily in HL-60 tissue culture cells and to a lesser 
extent in liver, breast, and uterus. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inununological diseases, hereditary disorders involving the MHC class 
of inunune molecules, as well as developmental disorders and reproductive disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 

30 type(s). For a number of disorders of the above tissues or cells, particularly of the 
mmiune and reproductive system expression of this gene at significandy higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

35 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
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comprising a sequence shown in SEQ ID NO: 390 as residues: Ser-39 to Gly-46, Leu- 
49toAla-62. 

The tissue distribution and homology to collagen indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
5 hereditary MHC disorders and particularly autoimmune disorders including rheumatoid 
arthritis, lupus, scleroderma, and dermatomyositis, as well as many reproductive 
disorders, including cancer of the uterus, and breast tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 158 

10 This gene is expressed primarily in the amygdala region of the brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, a variety of brain disorders, particularly those effecting mood and 

15 personality. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the brain and central nervous system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 

20 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for treatment and/or diagnosis of a variety of bram 
disorders, particularly bipolar disorder, unipolar depression, and dementia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 159 

This gene is expressed in a variety of tissues and cell types including brain, 
30 smooth muscle, kidney, salivary gland and T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present m a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancers of a variety of organs including brain, smooth muscle, kidney, 
35 salivary gland and T-cells and cardiovascular disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inraiunological probes 
for differential identification of tiie tissue(s) or cell type(s). For a number of disorders 
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of the above tissues or cells, particularly of the central nervous, urinary, salivary, 
digestive, and immune systems, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
5 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution in brain, smooth muscle, and T-cells indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis of 

10 various neurological, and cardiovascular disorders, but not limited to cancer within the 
above tissues. Additionally the gene product may be used as a target in the 
immunotherapy of the cancer. Because the gene is expressed in cells of lymphoid 
origin, the natural gene product may be involved in inmiune functions. Therefore it may 
be also used as an agent for inununological disorders including arthritis, asthma, 

IS inunune deficiency diseases such as AIDS, and leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 160 

The translation product of this gene shares sequence homology with collagen 
which is thought to be important in cellular interactions, extraceUular matrix formation, 

20 and has been found to be an identifying determinant in autoimmune disorders. 
Moreover, this gene shows sequence homology with the yeast protein, Slslp, an 
endoplasmic reticulum component, involved in the protein translocation process in 
Yeast Yarrowia lipolytica. (See Accession No. 1052828; see also J. Biol. Chem. 271, 
1 1668-1 1675 (1996).) With mouse, diis same region shows sequence homology with 

25 the heavy chain of kinesin. (See Accession No. 2062607.) Recently, suppression of the 
heavy chain of kinesin was shown to inhibits insulin secretion from primary cultures of 
mouse beta-cells. (See Endocrinology 138 (5), 1979-1987 (1997).) Moreover, kinesin 
was found associated with drug resistance and cell inmiortalization. (See 468355.) 
Thus, it is likely that this gene also act as a genetic suppressor elements. 

30 This gene is expressed primarily in the greater omentum and to a lesser extent in 

a variety of organs and cell types including gall bladder, stromal bone marrow cells, 
lymph node, liver, testes, pituitary, and thymus. , 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

35 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, disorders of the endocrine, gastrointestinal, and inununological systems, 
including autoinunune disorders and cancers in a variety of organs and cell types. 
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Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune and gastrointestinal systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 393 as residues: Asn-27 to Leu-47, Gln- 
81 to Lys-88, Asp-93 to Lys-102, Asn-107 to Leu-1 16, Met-129 to Glu-141, Glu-150 
to Asp-157, Lys-176 to Glu-185, Glu-333 to Tyr-349, Cys-393 to Leu-403, Gln-423 
to Gly-429. 

The tissue distribution in within various endocrine and inmiunological tissues 
combined with the sequence homology to a conserved collagen motif indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis of various autoimmune disorders including, but not limited to, rheumatoid 
arthritis, lupus erthyematosus, scleroderma, dermatomyositis Because the gene is 
expressed in cells of lymphoid origin, the natural gene product may be involved in 
inmiune functions. Therefore it may be also used as an agent for inmiunological 
disorders including arthritis, asthma, immune deficiency diseases such ais AIDS, and 
leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 161 

This gene has homology to the tissue inhibitor of metalloproteinase 2. Such 
inhibitors are vital to proper regulation of metalloproteins such as coUagenases (See 
Accession No. PI 6368). In addition, this gene maps to chromosome 17, and 
therefore, may be used as a marker in linkage analysis for chromosome 17 (See 
Accession No. P 1 6368). 

This gene is expressed primarily in several types of cancer including 
osteoclastoma, chondrosarcoma, and rhabdomyosarcoma and to a lesser extent in 
several non-malignant tissues including synovium, amygdala, testes, placenta. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, various types of cancer, particularly cancers of bone and cartilage, as 
well as various autoimmune disorders. Similarly, polypeptides and antibodies directed 
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to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the musculoskeletal system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
5 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in various cancers and the sequence homology to a 
10 collagenase inhibitor indicates that polynucleotides and polypeptides corresponding to 
this gene are useful for detection of various autoimmune disorders such as rheumatoid 
arthritis, lupus, scleroderma, and dermatomyositis. Therefore it may be also used as an 
agent for inununological disorders including arthritis, asthma, immune deficiency 
diseases such as AIDS, and leukemia. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 162 

This gene is homologous to the mitochondrial ATP6 gene and therefore is likely 
a homolog of this gene family (See Accession No. X76197). 
This gene is expressed primarily in brain tissue. 

20 Therefore, polynucleotides and polypeptides of the invention arc useful as 

reagents for differential identification of the lissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, a variety of brain disorders, including Down's syndrome, depression. 
Schizophrenia, and epilepsy. Similarly, polypeptides and antibodies directed to these 

25 polypeptides are useful m providing inununological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central nervous system, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues (e!g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

30 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution in brain tissue indicates this gene is useful for diagnosis 
of various neurological disorders including, but not limited to, brain cancer. 

35 Additionally the gene product may be used as a target in the immunotherapy of cancer in 
the brain as well as for the diagnosis of metabolic disorders such as obesity Tay-Sachs 
disease, phenylketonuria and Hurler's Syndrome. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 163 

This gene is expressed primarily in placenta, neutrophils, and microvascular 
endothelial cells and to a lesser extent in multiple tissues including brain, prostate, 
5 spleen, thymus, and bone. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neutropenea and other diseases of the immune system. Similarly, 

10 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the inunune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 

15 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in placenta indicates that polynucleotides and 
20 polypeptides corresponding to this gene are useful for diagnosis various female 

reproductive disorders. Additionally the gene product may be used as a target in the 
immunotherapy of various cancers. Because the gene is expressed in some cells of 
lymphoid and endocrine origin, the natural gene product may be involved in immune 
functions and metabolism regulation, respectively. Therefore it may be also used as an 
25 agent for immunological disorders including arthritis, asthma, immune deficiency 
diseases such as AIDS, and leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 164 

This gene is expressed primarily in neutrophils, monocytes, bone marrow, and 
30 fetal liver. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune system disorders including, but not limited to, autoinunune 
35 disorders such as lupus, and immunodeficiency disorders . Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
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of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
5 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in various immune system tissue indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis of various immunological disorders such as Hodgkin's lymphoma, arthritis, 
10 asthma, immune deficiency diseases such as AIDS, and leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 165 

The translation product of this gene shares sequence homology with dystrophin 
which is thought to be defective in both Duchene and Becker Muscular Dystrophy. 

15 Preferred polypeptide fragments comprise the following aniino acid sequence: 

MKLLGECSSSroSVKRLEHKLKEEEESLPGFVNLHSTETQTAGVroRWELLQAQ 
Al^KEUlMKQl^QKWQQFNSDU^SIWAWLQDTE^ 
IKKLKELQKAVDHRKAIIl^INLCSPEFTQADSKESRDLQDRIJCQM^ 
CSLLEEWRGLLQDALMQCQGFHEMSHGLlXMLENroRRKNEIWmSN^ 

20 QDHHKQLMQIKHELLESQLRVASLQDMSCQLLVNAEGTDCLEAKEKVHVIGNR 
LKLLLKEVSRHIKELEKLLDVSSSQQDLSSWSSADELDTSGSVSPXSGRSTPNR 
QKTPRGKCSLSQPGPSVSSPHSRSTKGGSDSSLSEPXPGRSGRGFLFRVLRAA 
LPLQLLLLLLIGLACLVPMSEEDYSCALSNNFARSFHPMLRYTNGPPPL (SEQ ID 
NO:726); MKLLGECSSSIDSVKRLEHKLKEEEESLPGFVNLHSTETQTAGVIDR 

25 WELLQAQAI^KELRMKQNLQKWQQFNSDU^SIWAWLGDTEEEI^QLQRl^I^ 
TDIQTIELQIK (SEQ ID NO:727); KLKELQKAVDHRKAIILSINLCSPEFTQADSK 
ESRDLQDRUCQMNGRWDRVCSIXEEWGliQDALMQCQGFHEMS 
ENroRRKNEIWIDSNLDAEILQDHHKQLMQIKHELI^QLRVASLQDMSC^^ 
(SEQ ID NO:728); QDMSCQliVNAEGTIXI£AKEKVHVIGNIU-KLLLKEVS 

30 RHIKELEKLLDVSSSQQDLSSWSSADELDTSGSVSPXSGRSTPNRQKTPRGKCS 
LSQPGPSVSSPHS (SEQ ID NO:729); DSSLSEPXPGRSGRGFLFRVLRAAL 
PLQLIXLmOLACLVPMSEEDYSCALSNNFARSFHPMLRYTNGPPPL (SEQ ID 
NO:730). Also prefened are polynucleotide fragments encoding these polypeptide 
fragments. Furthermore, this gene maps to chromosome 6, and therefore, may be used 

35 as a marker in linkage analysis for chromosome 6 (See Accession No. N62896). 

This gene is expressed in numerous tissues including the heart, kidney, and 

brain. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not linuted to, musculoskeletal disorders including Muscular Dystrophy and 
5 cardiovascular diseases. Similarly, polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or ceU type(s). For a number of disorders of the above tissues or cells, 
particularly of the muscle tissues, expression of this gene at significandy higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 

10 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
die standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to dystrophin indicates that 

15 polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and treatment of Muscular Dystrophy and other muscle disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 166 

This gene is expressed primarily in human cerebellum. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of the central nervous system, including Alzheimer's Disease, 
Parkinson's Disease, ALS, and mental illnesses. Similarly, polypeptides and antibodies 

25 directed to these polypeptides are useful in providing inmiunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the central nervous system, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 

30 synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healtiiy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 399 as residues: Pro-20 to Gly-26, Leu-37 to Pro-42. His-57 to Gly-63. 

35 The tissue distribution indicates that the protein products of this gene are useful 

for treatment/diagnosis of diseases of the central nervous system and may protect or 
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enhance survival of neuronal cells by slowing progression of neurodegenerative 
diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 167 

Preferred polypeptides encoded by this gene comprise the following amino acid 
sequence: 

MKLLICGNYLAPSHSESSRRCCLLCFYPLCLEINFGMKVFLSMPFLVLFQ 
SLIQED (SEQ ID NO:731). Polynucleotides encoding such polypeptides are also 
provided. This gene is believed to reside on chromosome 15. Therefore polynucleotides 
derived from this gene are useful in linkage analysis as chromosome 15 markers. 

This gene is expressed primarily in human testes tumor and to a lesser extent in 
normal human testes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagentsTor differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of the testes, particularly cancer, and other reproductive 
disorders. Similarly, polypeptides and antibodies directed to tiiese polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the male reproductive tissues, expression of this gene at significantiy higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
for treatment/diagnosis of testicular diseases including cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 168 
This gene is expressed primarily in fetal liver. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of ttie tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, conditions affecting hematopoietic development and metabolic diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides aie useful in 
providing inmiunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
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hepatic system, and fetal hematopoietic system, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
5 relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. Preferred epitopes include 
those comprising a sequence shown in SEQ ID NO: 401 as residues: His-7 to Trp-17, 
Leu- 19 to Lys-27, Pro-33 to Gly-44, Lys-68 to Gly-74, Lys-SS to Cys-95, 

The tissue distribution indicates that the protein products of this gene are useful 
10 for treatment/diagnosis of diseases of the developing liver and hematopoietic system, 
and act as a growth differentiation factor for hematopoietic stem cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 169 

The polypeptide encoded by this gene is believed to be a membrane bound 
1 5 receptor. The extracellular domain of which is expected to consist of the following 
amino acid sequence: 

RnXVKYSANEENKYDYUTI^NVCSELVKLWCVLVSFCV^^ 

ASWKEFSDFMKWSIPAFLYFLDNLIVFYVl^YLQPAMAVn^^ 

LKXRLNWIQWASLLTLFLSIVALTAGTKTLQHNLAGRGFHHDAFFSPSNSCLL 

20 FRNECPRKDNCTAKEWTFPEAK\^^S^TTARVFSHIRLGMGHVL^VQC^SS^^ 
YNEKILKEGNQLTEXIFIQNSKLYFFGILFNGLTLGLQRSNRDQIKNCGFFYGH 
S (SEQ ID NO:732). Thus, preferred polypeptides encoded by this gene comprise the 
extracellular domain as shown above. It will be recognized, however, that deletions of 
either end of the extracellular domain up to the first cysteine from the N-terminus and 

25 the first cysteine of the C-terminus, is expected to retain the biological ftmctions of the 
full-length extracellular domain because the cysteines are thought to be responsible for 
providing secondary structure to the molecule. Thus, deletions of one or more amino 
acids from either end (or both ends) of the extracellular domain are contemplated. Of 
course, further deletions including the cysteines are also contemplated as useful as such 

30 polypeptides is expected to have inmiunological properties such as the ability to evoke 
and inmiune response. Polynucleotides encoding all of the foregoing polypeptides are 
provided. 

This gene is expressed primarily in human osteoclastoma and to a lesser extent 
in hippocampus and chondrosarcoma. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, cancers, particularly those of the bone and connective tissues. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of die tissue(s) or cell type(s). For 
a nimiber of disorders of the above tissues or cells, particularly of the skeletal system, 
expression of this gene at significantiy higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
tile expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 402 as residues: Met-1 to Cys-6, Ala-41 to Tyr-49, Lys-76 to Lys-84. 

The tissue distribution indicates that the protein products of this gene are useful 
for diagnosis of cancers of the bone and connective tissues, and may act as growth 
factors for cells involved in bone or connective tissue growth. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 170 

Preferred polypeptides encoded by tiiis gene comprising the following anuno 
acid sequence: 

NSVPNLQTLAVLTEAIGPEPAIPRXPREPPVATSTPATPSAGPQPLPTGTV 
LVPGGPAPPCLGEAWALLLPPCRPSLTSCFWSPRPSPWKETGV (SEQ ID 
NO:733). Polynucleotides encoding such polypeptides are also provided herein. 

This gene is expressed primarily in hematopoietic progenitor cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of the blood including cancer and autoinunune disorders. 
Similarly, polypeptides and antibodies directed to tiiese polypeptides are useful in 
providing mmiunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
blood/circulatory system, expression of tiiis gene at significantiy higher or lower levels 
may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having tiie disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 403 as residues: Gln-4 to His-10, Pro-25 
to His-32. 
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The tissue distribution indicates that the protein products of this gene are useful 
for diagnosis of diseases involving growth differentiation of hematopoietic cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 171 
5 Preferred polypeptides encoded by this gene comprise the following amino acid 

sequences: ALQLAFYPDAVEEWLEENVHPSLQRLQXLLQDLSEVSAPP (SEQ ID 
NO:734); and/or CHPPALAGTLLRTPEGRAHARGLLLEAGGA (SEQ ID NO:735). 
Polynucleotides encoding such polypeptides are also provided. The protein product of 
this gene shares sequence homology with metallothionines. Thus, polypeptide encoded 

10 by this gene are expected to have metallothionine activity, such activities are known in 
the art and described elsewhere herein. 

This gene is expressed primarily in kidney cortex. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of the kidney including cancer and renal dysfunction. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the renal system, 

20 expression of this gene at significandy higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

25 disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 404 as residues: Ser-47 to Gln-52. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to tiiis gene are useful for treatment/diagnosis of diseases of the kidney 
including kidney failure. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 172 

This gene is expressed primarily in 1 2 week old early stage human. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental abnormalities. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
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differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the developing embryo, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
5 synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 405 as residues: Gln-31 to Thr-43, Gly-51 to Ser-58, Pro-65 to Pro-72. 
10 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treatment/diagnosis of developmental 
problems with fetal tissue. The gene may be involved in vital organ development in the 
early stage, especially hematopoiesis, cardiovascular system, and neural development. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 173 

The translation product of this gene shares sequence homology with TGN38, an 
integral membrane protein previously shown to be predominantly localized to the trans- 
Golgi network (TON) of cells. 

This gene is expressed primarily in developing embryo and to a lesser extent in 

20 cancer tissues including lymphoma, endometrial, protate and colon. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental abnormalities and cancer. Similarly, polypeptides and 

25 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the developing fetus, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g.,, serum, plasma, urine, 

30 synovial fluid or spinal fluid) or another tissue or cell sample taken from aii individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those con^rising a sequence shown in SEQ ID 
NO: 406 as residues: His-65 to Ser-72, Pro-82 to Gly-91, Pro-98 to Glu-1 18, Ser-126 

35 to Gly-166, Pro-180 to Asp-188, Tyr-209 to Lys-214, Gln-220 to Leu-228. 

The tissue distribution and homology to an integral membrane protem indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 
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diagnosis of cancers and developmental abnormalities where aberrant expression relates 
to an abnormality. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 174 

5 The translation product of this gene shares sequence homology with a dnaJ heat 

shock protein from E. coli which is allelic to sec63, a gene that affects transit of nascent 
secretory proteins across the endoplasmic reticulum in yeast. 

This gene is expressed primarily in Hodgkin's lymphoma and to a lesser extent 
in testes. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancer. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inmiunological probes for differential identification 

15 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken frohi an individual having such a disorder, relative to 

20 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 407 as residues: Thr-13 to Trp-21, Arg- 
74toAsp-81. 

The tissue distribution and homology to dnaJ indicates that polynucleotides and 
25 polypeptides corresponding to this gene are useful as a diagnostic for cancer including 
Hodgkin's lymphoma. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 175 

This gene is expressed primarily in endothelial cells and to a lesser extent in 
30 bone marrow stromal cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases involving angiogenic abnormalities including diabetic 
35 retinopathy, macular degeneration, and other diseases including arteriosclerosis and 
cancer. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
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lype(s). For a number of disorders of the above tissues or cells, particularly of the 
vascular system, expression of this gene at significantiy higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
5 cell sample taken from an individual having such a disorder, relative to the standard 
gene expression levels i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
for treating diseases where an increase or decrease in angiogenesis is indicated and as a 
10 factor in the wound healing process. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 176 

The translation product of this gene shares sequence homology with MATS 
(mouse) which is thought to be important in regulating chloride conductance in cells 

1 5 (particularly in the breast) by modulating the response mediated by c AMP and protein 
kinase C to extracellular signals. 

This gene is expressed primarily in amniotic cells and hematopoeitic ceils 
including macrophages. Neutrophils, T cells, TNF induced aortic endothelium and to a 
lesser extent in testes, TNF induced epithelial cells, and smooth muscle. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflanmiatory responses mediated by T cells, macrophages, and/or 
neutrophils particularly those involving TNF, and also cancer. Similarly, polypeptides 

25 and antibodies directed to these polypeptides are useful in providing inmiunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 

30 urine, synovial fluid or spinal fluid) or another tissue or cell sanq)le taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 409 as residues: Thr-19 to Ala-33. Leu-54 to Asp-82, Pro-89 to Ala-97, Pfo-100 

35 to Lys-125, Ser-127 to Phe-135, Gly-164 to Leu-169. Cys-173 to Arg-178. 

The tissue distribution and homology to mat-8 indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for modifying inflammatory 
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responses to cytokines such as TNF and thus modijfying the duration and/or severity of 
inflanunation. Polynucleotides and polypeptides derived from this gene are thought to 
be useful in the diagnosis and treatment of cancer. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 177 

This gene is expressed primarily in endothelial cells. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

10 not linndted to, vascular restenosis. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the vascular system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

15 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 
The tissue distribution indicates that polynucleotides and polypeptides 

20 corresponding to this gene are useful for treating diseases associated with vascular 
response to injury such as vascular restenosis following angioplasty.. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 178 
One embodiment of the claimed invention comprises: 

25 MRPDWKAGAGPGGPPQKPAPSSQRKPPARPSAAAAAIAVAAAEEERRLRQRN 
RLRLEEDKPAVERCLEELVFGDVENDEDALLRRLRGPRVQEHEDSGDSEVENEA 
KGNFPPQKKPVWVDEEDEDEEMVDMMNNRFRKDMMKi^ASESK^ 
RLKEEFQHAMGGVPAWAETTKRKTSSDDESEEDEDDLLQRTGNHSTSTSLPRG 
ILKMKNCQHANAERPTVARISICAVPSRCTDCDGCWD (SEQ ID NO:737); or 

30 CI^ELWGDVENDEDALLRRLRGPRVQEHEDSGDSEVENEAKGNFPPQKKPV 
WVDEEDEDEEMVDMMNNRFRKDMMKNASESKIJSKDNL 
GWAWAETTKRKTSSDDESEEDEDDLLQRTGNHSTSTSLPRGILKMKNCQHA 
NAERPTVARISICAVPSRCTDCDGC (SEQ ID NO: 738). LKEKIVRSFEVSPDGS 
FLLINGIAGYLHLLAMKTKELIGSMKINGRVAASTFSSDSKKVYASSGDGEV^ 

35 WDVNSRKCLNRFVDEGSLYGLSIATSRNGQYVACGSNCGVVNIYNQDSCLQE 
TNPmKAIMNLWGWSLTFNPTTEIIJS^IASEKMKE 

KNKNISHVHTMDFSPRSGYFALGNEKGKALMYRLHHYSDF (SEQ ID NO:739); 
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and/or KINGRVAASTFSSDSKKVYASSGDGEVYVWDVNSRKCLNRFVDEGSL 
YGLSIATSRNGQYVACGSNCGVWIYNQDSCLQETNPmKAIMN^ 
FNPTTEILAIASEKMKEAVRLVHLPSCTWSNFPVIKNK^ 
YFALGNEKGKAL (SEQ ID NO:740). 

This gene is expressed primarily in epidydimus and endometrial tumors and to a 
lesser extent in T cell lymphoma and cell lines derived from colon cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, tumors of the reproductive organs including testis and endometrial cells. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing inmiunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
reproductive system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 41 1 as residues: Ser-67 to Lys-72, Val-87 to Leu-93, 
Tyr-128 to Pro-141, Asp-204 to Gly-210. 

The tissue distribution indicates that the protein products of this gene are, useful 
for treating tumors of the endometrium or epithelial tumors of the reproductive system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 179 

Preferred polypeptides encoded by this gene comprise the following amino acid 
sequence: 

MRH-QLILLALATGLVGGETRnKGFECKLHSQPWQAAUTEKTRLLCGATLI^^ 

WLLTAAHCLKPRYrVHLGQHNLQKEEGCEQTRTATESFPHPGFNNSLPNK^ 

RNDIMLVigviASPVSITWAVRPLTLSSRCVTAGTSCSFPAG;^^ 

DAPTSPSLSTRSWTPTPATSQTPWCWACRKGARTPARWPGALWSWSIJ 

LSPGARIRVRSPESLVSTRKSANMWTGSRRR (SEQ ED NO:741); ETRHKGFEC 

KLHSQPWQAAMKTRLLCGATLIAPRWIiTAAHCLKPRYrVHLGQ™ 

GCEQTRTATESFPHPGFNNSLPNKDHRNDIMLVKMASPVSrm 

CVTAGTSCSFPAGAARPDPSYACLTPCDAPTSPSLSTRSVRTPTPATSQTPWCVP 

ACRKGARTPARVTPGALWSVTSIJKALSPGARIRVRSPESLVSTRKSANMW^ 
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SRRR (SEQ ID NO:742); or CKLHSQPWQAALFEKTRLLCGATLIAPRWLLT 
AAHCLKPRYIVHLGQHNLQKEEGCEQTRTATESFPHPGFNS 
(SEQ ID NO:743). The translation product of this gene shares sequence homology 
with neuropsin a novel serine protease which is thought to be unportant in modulating 
5 extracellular signaling pathways in the brain. Owing to the structural similarity to other 
serine proteases the protein products of this gene are expected to have serine protease 
activity which may be assayed by methods known in the art and described elsewhere 
herein. 

This gene is expressed primarily in endometrial tumor and to a lesser extent in 

10 colon cancer, benign hypertrophic prostate, and thymus. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancers of the endometrium or colon and benign hypertrophy of the 

15 prostate. Similarly, polypeptides and antibodies directed to these polypeptides are 

useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the urogenital or reproductive systems, expression of this gene at significantly higher or 
^ lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 

20 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 412 as residues: Gly-12 to Ser-22, Pro- 

25 34 to Ser-53. 

The tissue distribution and homology to serine proteases indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosing 
or treating hperproliferative disorders such as cancer of the endometrium or colon and 
hyperplasia of the prostate. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 180 

Preferred polypeptide encoded by this gene comprise the following amino acid 
sequence: VLQGRYFSPILEMRRLRPEGXXNLPGGSRAQKEPRQDLTLVLWPHC 
PHFAMTRSYVPTKQCMVQGSFYCIFIFKGPVQNWC (SEQ ID NO:744). 
35 Polynucleotides encoding such polypeptide are also provided. 
This gene is expressed primarily in fetal brain 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, identifying and expanding stem cells in the CNS. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the nervous system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
for detecting and expanding stem cell populations in the (or of the) central nervous 
system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 181 

This gene is expressed primarily in early stage human brain and a stromal cell 

line. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental abnormalities of the CNS. Similarly, polypeptides and . 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the central nervous system, expression of 
this gene at significantiy higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
tiie expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 414 as residues: Gln-42 to Gln-47, Gln-54 to Pro-60. 

The tissue distribution indicates that the protein products of this gene play a role 
in the development of the central nervous system. Therefore this gene and its products 
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are useful for diagnosing or treating developmental abnormalities of the central nervous 
system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 182 
5 Preferred polypeptides encoded by this gene comprise the following amino acid 

sequence: 

MPinXJVNPELHDFMQSAEVGTIFAl^WLITWGHVl^DFRHVV^^ 

FLACHPLMPIYFAAVIVLYREQEVLDCDCDMASVHHLl^QIPQDLPYEm 

TFLFSFPHPNLLGRPLPNSKLRGRQPLLSKTLSWHQPSRGLIWCCGSGXRGLL 

10 RPEDRTKD\a.TKPRTNRFVKLAVMGLTVALGAAALAVVKSALEWAPKF^ 
FP (SEQ ID NO:745); or CPEFFIPATLPCPFVFAFTSEASSRAYLTQRGPGGLAQ 
NLMPLPVGFWMGSLPPPWCWRKWVSEACSCFC (SEQ ID NO:746) These 
polypeptides are structurally similar to various TGF-beta family members. Thus, this 
polypeptide is expected to have a variety of activities in the modulation of cell growth 

15 and proliferation. 

This gene is expressed primarily in osteoclastoma, microvascular endothelium, 
and bone marrow derived cell lines. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, hematological diseases particularly involving aberraiit proliferation of 
stem cells. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

25 the immune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

30 individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 415 as residues: Ser-33 to Ala-39. 

The tissue distribution indicates that the protein products of this gene is useful 
for treating disorders of the progenitors of the immune system. Applications include in 
vivo expansion of progenitor cells, ex vivo expansion of progenitor cells, or the 

35 treatment of tumors of the circulatory system, such as lymphomas. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 183 

This gene maps to chromosome 17 and therefore, polynucleotides of the 
invention can be used in linkage analysis as a marker for chromosome 17. In specific 
embodiments, polypeptides of die invention comprise the sequence: 
5 GFGS VSAAGRRSGGTWQPVQ (SEQ ID NO:747): PGGLAVGSRWWSRSLT 
(SEQ ID NO:748); LEPSRQRRPRRRGGTSRPETDQRAKCWRQL (SEQ ID 
NO:749); and/or VCLRCQNRMEN (SEQ ID NO:750). In further specific 
embodiments, polypeptides of the invention comprise the sequence: MAACTARRPGR 
GQPLWPVADXGPVAKAALCAAXAGAFSPASTTTTRRHLSSRNRPEGKVLETV 
10 GXO^WKQNGKYETGQLFLHSIFGYRGVVLFPWQARLXDRDVASAAPEKAEN 
PAGHGSKEVKGKTHTYYQVLIDARDCPHISQRSQTEAVTFLANHDDSRALYAIP 
GLDYVSHEDILPYTSTDQWIQHELFERI^LYDQTKAPPFVARETLRAWQEKNH 
PWl^LSDVHRETreNmVTVIPFYMGMREAQNSHVYWW 

LRERHWRIFSLSGTLETVRGRGWGREPVLSKEQPAFQYSSHVSLQASSGHMW 

15 GTFRFERPDGSHFDVRIPPFSLESNKDEKTPPSGLHW (SEQ ID NO:751); 
MAACTARRPGRGQPLVVPVADXGPVAKAALCAA (SEQ ID NO:752); 
VLETVGVFEVPKQNGKYETGQLFLHSIFGYRGVVL (SEQ ID NO:757); 
GLDYVSHEDILPYTST (SEQ ID NO:758); DVHRETTENIRVTVPFYM (SEQ ID 
NO:759); WRYCIRLENLDSDVVQLRER (SEQ ro NO:760); and/or P^^ 

20 HVSLQASSGHMWGTFRFER(SEQIDNO:761). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. 

This gene is expressed primarily in gall bladder, prostate, and fetal brain, and to 
a lesser extent in a few tumor and fetal tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, growth related disorders such as cancers. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

30 of the above tissues or cells, particularly of the prostate, gall bladder, and fetal brain, 
expression of this gene at significanfly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 

35 tiie expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to tiiis gene are useful for diagnosis and treatment of growth-related 
disorders, such cancers. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 184 

In specific embodiments, polypeptides of the invention comprise the 
sequence:SLCCPEGAEGC (SEQ ID NO:762) and/or QLKKTHYDRPCP (SEQ ID 
.NO:763). Polynucleotides encoding tiiese polypeptides are also encompassed by the 
invention. 

10 This gene is expressed primarily in stromal cell, tonsil, and glioblastoma and to 

a lesser extent in some other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

15 not limited to, immune and inflammatory disorders and glioblastoma. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providkig 
inmiunological probes for differential identification of die tissue(s) or cell type(s). For 
a number of disorders of tiie above tissues or cells, particularly of the stromal cells, 
tonsil, and glioblastoma expression of this gene at significantly higher or lower levels 

20 may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or anotiier 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Additionally, it is believed that die 

25 product of this gene regulates pancreatic cell differentiation into beta cells. Accordingly, 
polynucleotides and polypeptides of the invention are useful in die treatment of insulin- 
dependent diabetes niellitus and associated conditions e.g. pancreatic hypofunction and 
the prevention, as well as the treatment of undifferentiated type pancreatic cancers. 
Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 417 as 

30 residues: Pro-27 to Ala-32. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of immune and 
inflanmiatory disorders and glioblastoma. 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 185 

This gene is expressed primarily in hepatocellular carcinoma and to a lesser 
extent in other tissues. 



wo 98/54963 r PCTAJS98/11422 

145 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, liver diseases. Similarly, polypeptides and antibodies directed to these 
5 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the liver, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
10 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 418 as residues: Gly-32 to Lys-39. 

The tissue distribution indicates that polynucleotides and polypeptides 
1 5 correspondmg to this gene are useful for diagnosis and treatment of liver diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 186 

This gene is expressed primarily in hippocampus and to a lesser extent in other 

tissues. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neutronal disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 

25 identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the hippocampus, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues (e g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 

30 relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene arc useful for diagnosis and treatment of neuronal disorders. 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 187 

This gene is expressed primarily in bone cancer and hippocampus and to a 
lesser extent in osteoclastoma and other tissues. 
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Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, bone-related disorders and neuronal diseases. Similarly, polypeptides 
5 and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the bone, ostoeclast, and 
hippocampus, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 

10 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

15 corresponding to this gene are usefiil for diagnosis and treatment of bone-related 
disorders and neuronal diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 188 

This gene maps to chromosome 4 and therefore polynucleotides of the invention 
20 can be used in linkage analysis as a marker for chromosome 4. 

This gene is expressed primarily in neuronal tissues such as hippocampus, 
spinal cord, and hypothalamus and to a lesser extent in a few other tissues such as 
ovary. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neuronal diseases. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful m providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
30 tissues or cells, particularly of the neuronal tissues, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken fix)m an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
35 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of neuronal disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 189 

This gene maps to chromosome 10, therefore, polynucleotides of the invention 
can be used in linkage analysis as a marker for chromosome 10. 
5 This gene is expressed primarily m neuronal tissues and immune tissues, and to 

a lesser extent in a few other tissues such as skin tumor, lung etc. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

10 not limited to, neuronal and immune-related disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inmiunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the neuronal and immune-related tissues, 
expression of this gene at significantly higher or lower levels may be routinely detected 

15 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not havmg the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 

20 NO: 422 as residues: Pro- 1 9 to Asp-25. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of neuronal and 
inunune-related disorders. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 190 

The translation product of this gene shares sequence homology with human 
N33, a gene located in a homozygously deleted region of human metastatic prostate 
cancer which is thought to be important in prevention of prostate cancer. In specific 
embodiments, polypeptides of the invention comprise the sequence: 

30 AQRKKEMVI^EKVSQIJVlEWTNKRPXaRMNGDKF^ 
LQLHRQCWCKQADEEFQILANSWRYSSAFIMIIFFAM 
NSAPTFINIYAKGKPKRGDTYELQVRGFSAEQIARWIADRTDVm 
ARWRFWCVSVT (SEQ ID NO:765); MVVALLIVCDVPSAS (SEQ ID NO:766); 
AQRKKEMVLSEKVSQL (SEQ ID NO:767); MEWTNKRPVIRMNGDKF (SECj 

35 ID:768);RRLVKAPPRNYSVIVMFTALQliIRQCWCKQADEEFQI^ 

SSAFTNRIFFA (SEQ ID NO:769); MVDFDEGSDVFQMLNMNSAPTFINFPAK 
GKP (SEQ ID NO:770); KRGDTYELQVRGFSAEQIARWIADRTDVNIRVIRPPN 
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(SEQ ID NO:77 1); and/or YAGPimGU.LAVIGGLVYlJlRVIWNFSLIKLDGLLQL 
CVLCLL (SEQ ID NO:772). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

This gene is expressed primarily m infant adrenal gland prostate cell line and to 
5 a lesser extent in a few other tissues like liver, smooth muscle etc. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, prostate cancer and endocrine disorders. Similarly, polypeptides and 

10 antibodies directed to these polypeptides are useful in providing inrununological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the prostate and adrenal gland, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 

15 urine, synovial fluid or spmal fluid) or another tissue or cell sample taken firom an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid fi-om an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 423 as residues: Pro-34 to Gly-43, Arg-1 13 to Pro-120. 

20 The tissue distribution and homology to N33 indicates that polynucleotides and 

polypeptides corresponding to this gene are useful for diagnosis and treatment for 
prostate cancer and endocrine disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 191 
25 This gene is expressed primarily in T cell and to a lesser extent in fetal lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune disorders. Similarly, polypeptides and antibodies directed to 
30 these polypeptides are useful in providing immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune, expression of this gene at significantiy 
higher or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
35 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e.. the expression level in healthy tissue 
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or bodily fluid from an individual not having the disorder. Preferred epitopes.include 
those comprising a sequence shown in SEQ ID NO: 424 as residues: Trp-3 to Phe-9. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of immune disorders. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 192 

This gene maps to chromosome 6, therefore, polynucleotides of the invention 
can be used in linkage analysis as a marker for chromosome 6. Neural activity and 
neurotrophins induce synaptic remodeling in part by altering gene expression. This 

10 gene is believed to be a glycosylphoshatidylinositol-anchored protein encoded by a 
hippocampal gene and to possess neural activity. This molecule is believed to be 
expressed in postmitotic-differentiating neurons of the developing nervous system and 
neuronal structures associated with plasticity in the adult. Message of this gene is 
believed to be induced by neuronal activity and by the activity-regulated neurotrophins 

15 BDNF and NT-3. The product of this gene is believed to stimulate neurite outgrowth 
and arborization in primary embryonic hippocampal and cortical cultures and to act as a 
downstream effector of activity-induced neurite outgrowth. In specific embodiments, ^ 
polypeptides of the invention comprise the sequence: DAVFKGFSDCLLKLGDS (SEQ 
ID NO:773); CQEGAKDMWDKLRKESKNLN (SEQ ID NO:774); 

20 VLLVSLSAALATWLSF (SEQ ID NO:775); MGLKLNGRYISLILAVQIAYLVQAVR 
AAGKCDAVFKGFSDCLLKLGDS (SEQ IDNO:776); PAAWDDKTNIKTVCTYW 
EDFHSCTVTALTDCQEGAKDMWDKLRKESKNUSIIQGSLFELCGSGN 
LPAFPVLLVSLSAALATWLSF (SEQ ID NO:777); and/or MGLKLNGRYISLILA 
VQIAYLVQAVRAAGKCDAVFKGFSDCLLKLGDSXXXXXPAAWDDKTNIKTVC 

25 TYWEDFHSCrVTALTDCQEGAKDMWDKLRKESKmNIQGSLFEL^ 

GSLLPAFPVLLVSLSAALATWLSF(SEQIDNO:778). Polynucleotides encoding 
this polypeptide are also encompassed by the invention. 

This gene is expressed primarily in human placenta, endometrial tumor and 
tissues of the central nervous system (CNS). 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, relating to reproductive disorders, cancers and neurological diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful m 

35 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
reproductive and neurological disorders, expression of this gene at significantly higher 
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or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i,e„ the expression level in healthy tissue 
5 or bodily fluid from an individual not having the disorder. Preferred epitopes include 
those comprising a sequence shown in SEQ ID NO: 425 as residues: Asp-47 to Asp- 
63, His-75 to Tyr-80, Pro-83 to Tyr-89. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of reproductive 
10 disorders such as endometrial tumors. Expression of this gene in tissues of the CNS 
and its strong homology to Neuritin suggest that the protein product from this gene may 
also be used in the treatment and diagnosis of neurological disorders and in the 
regeneration of neural tissues, e.g., following injury. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 193 

The translation product of this gene shares sequence homology with tenascin 
which is thought to be important in development. The translation product of this gene is 
believed to be a ligand of the fibroblast growth factor family. FGF ligand activity is 
known in the art and can be assayed by methods known in the art and disclosed 
20 elsewhere herein. 

This gene is expressed primarily in endometrial tumors, and other types of 
tumors. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancers. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the cancer tissues, expression of this gene at significantly higher or lower 

30 levels may be routinely detected in certain tissues (e.g., cancerous and wounded 

tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 

35 comprising a sequence shown in SEQ ID NO: 426 as residues: Gly-29 to Glu-34, Arg- 
71 to Arg-76, Thr-176 to Cys-182, Gly-184 to Glu-199. 
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The tissue distribution and homology to tenascin indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 194 

In specific embodiments, polypeptides of the invention comprise the sequence: 
MNSAAGFSHLDRRERVLKLGESFEKQPRCASTLC (SEQ ID NO:779). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
This gene is expressed primarily in fetal human lung and neutrophils. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, lung development and respiratory disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inmiunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the respiratory system, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in fetal lung and neutrophils indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment of lung and immunity related diseases, for example, lung cancer, viral, 
fungal or bacterial infections (e.g. lesions caused by tuberculosis), inflammation (e.g. 
pneumonia), metabolic lesions etc. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 195 

This gene is expressed primarily in breast lymph node. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immunal disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
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significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
5 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of inununal 
disorders. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 196 

This gene maps to chromosome 5 and accordingly, polynucleotides of the invention can 
be used in linkage analysis as a marker for chromosome 5. The translation product of 
this gene shares sequence homology with human M-phase phosphoprotein 4 which is 
thought to be important in phosphorylation and signal transduction processes. In 

15 specific embodiments, polypeptides of the invention comprise tfie sequence: 
TIYPTEEELQAVQKIVSITERALKLVSD (SEQ ID NO:780); RALKGVLRV 
GVLAKGLLLRGDRNVNLVLLC (SEQ IDNO:78I); ALAALRHAKWFQARAN 
GLQSCVmRILRDLCQRVPTWS (SEQ ID NO:782); GDALRRVFECISSGIIL (SEQ 
ID NO:783); LAFRQIHKVLXjMDPLP (SEQ ID NO:784); and/or TIYPTEEELQAVQ 

20 KIVSITERALKLVSDSLSEHEKNKNKEGDDKKEGGKDRALKGVLRVGVLAKG 
LLLRGDRNVNLVLLCSEKPSKTLLSRIAENLPKQLAVISPEKYDIKCAVSEAAIIL 
NSC\^PmQVTrrLTSPiniEENMREGDWSGMVKDPPD\a.DRQKCLDALA^ 
HAKWFQARANGLQSCVmn.RDLCQRVPTWSDFPSWAMELLVEKAISSASSP 
QSPGDALRRVFECISSGIILKGSPGLLDPCEKDPFDTLATMTDQQREDITSSAQFA 

25 LRLLAFRQIHKVLGMDPLPQMSQRFNIHNNRKRRRDSDGVDGFEAEGKKDKK 
DYDNF (SEQ ID NO:785). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

This gene is expressed primarily in Human Hippocampus and to a lesser extent 
in Prostate, Human Frontal Cortex. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, disorders related to reproductive system and nervous system. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

35 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the reproductive 
system and nervous system, expression of this gene at significantly higher or lower 
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levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
5 fluid from an individual not having the disorder. 

The tissue distribution and homology to human M-phase phosphoprotein 4 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
the diagnosis and treatment of reproductive and nervous system disorders. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 197 

In specific embodiments, polypeptides of the invention comprise the sequence: 
MGSQHSAAARPSSCRRKQEDDRDG (SEQ ID NO:786); 
LLAEREQEEAIAQFPYVEFTGRDSrrCLTC (SEQ ID NO:787); and/or 
(2GTGYIPTEQVNELVALIPHSDQRLRPQRTKQYV (SEQ ID NO;788). 
15 Polynucleotides encoding these polypeptides are also encompassed by the invention. 

This gene is expressed primarily in Human Primary Breast Cancer and to a 
lesser extent in Human Adult Spleen, Hodgkin's Lymphoma I, Salivary Gland. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancer and immunal disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of Uie tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the cancer and inmiune system, expression of 
25 this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disbrder, relative to the standard gene expression level, i.e., 
the expression level m healthy tissue or bodily fluid from an individual not having the 
30 disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 430 as residues: Ser-126 to Gly-138. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of cancer and 
immunal disorders. 



35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 198 
This gene is expressed primarily in monocytes^ 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, blood cell disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid.or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of blood cell 
disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 199 

This gene is expressed primarily in Human Ovary and Synovia and to a lesser 
extent in Human 8 Week Whole Embryo. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, reproductive and developmental disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissuc(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the reproductive and developmental system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual havmg such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of reproductive 
and developmental disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 200 

This gene maps to chromosome 8 and therefore polynucleotides of the invention 

can be used in linkage analysis as a marker for chromosome 8. The translation product 
5 of this gene shares limited sequence homology with collagen proline rich domain. 
This gene is expressed primarily in CNS. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

10 not limited to, neurological diseases. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing inmiunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the nervous system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

15 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to tiie standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disoixier. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 433 as residues: 

20 Pro-35 to Asp-41. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of neurological 
diseases. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 201 

Translation product of this gene shares homology with a manunalian histone 
Hla protein. One embodiment for this gene is the polypeptide fragments comprising the 
following amino acid sequence: ARLNVGRESLKREMLKSQGVKVSESPMGAR 
HSSWPEGAAFCKKVQGAQMQFPPRR (SEQ ID NO:789); ARLNVGRESLKR 

30 EML (SEQ ID NO:790); LKSQGVKVSESPMGARHSSW (SEQ ID NO:791); 
AFCKK VQGAQMQFPPRR (SEQ ID NO:792). An additional embodiment is tiie 
polynucleotide fragments encoding these polypeptide (See Accession No. pu"IS24178) 
fragments. 

This gene is expressed primarily in neutrophils. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, immune disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 

5 significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

10 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to. this gene are useful for diagnosis and treatment of immune disorders: 
Since the gene is expressed in cells of lymphoid origin, the natural gene product may be 
involved in vital immune functions. Therefore it may be also used as an agent for 
inunimological disorders including arthritis, asthma, immune deficiency diseases such 

IS as AIDS, and leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 202 

This gene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 

25 tissues or cells, particularly of the inmiune system, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

30 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of immune disorders. 
Since the gene is expressed in cells of lymphoid origin, the natural gene product may be 
involved in immune functions. Therefore it may be also used as an agent for 

35 inununological disorders including arthritis, asthma, immune deficiency diseases such 
as AIDS, and leukemia. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 203 

This gene is expressed primarily in Neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, infectious disorders, immune disorders, and cancers. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 436 as residues: Thr-31 to Lys-36. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of infectious 
disorders, immune disorders, and cancers. Since the gene is expressed in cells of 
lymphoid origin, the natural gene product may be involved in immune functions. 
Therefore it may be also used as an agent for immunological disorders including 
arthritis, asthma, immune deficiency diseases such as AIDS, and leukemia. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tumors and tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 204 

This gene maps to chromosome 16 and therefore polynucleotides of the 
invention can be used in linkage analysis as markers for chromosome 16. The 
translation product of this gene shares sequence homology with lactate dehydrogenase 
which is thought to be important in lactate metabolism. 

This gene is expressed primarily in human tonsils and to a lesser extent in 
Spleen, and Neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inunune disorders, infectious disorders, and cancers. Similarly, 
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polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune 
disorders, infectious disorders, and cancers, expression of this gene at significantly 
5 higher or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. Preferred epitopes include 
10 those comprising a sequence shown in SEQ ID NO: 437 as residues: Gly-7 to Ser-12. 

The tissue distribution and homology to lactate dehydrogenase gene indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of immune disorders, infectious disorders, and cancers. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 205 

The translation product of this gene shares sequence homology with Gcapl 
protein which is developmentally regulated in brain. 

This gene is expressed primarily in placenta and endometrial tumor and to a 
lesser extent in several other tumors. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, vasculogenesis/angiogenesis and tumorigenesis. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 

25 probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the vascular system and tumors, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 

30 an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to Gcapl protein indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
35 and treatment of disorder or dysfunction of vascular system of tumorigenesis. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 206 

In specific embodiments, polypeptides of the invention comprise the sequence 
MPYAQWLAENDRFHEAQKAFHKAGRQREA (SEQ ID NO:799); 
VQX^EQLTNNAVAESRFNDAAYYYWMLSMQCLDIAQD (SEQ ID NO:794); 
5 PAQKDTMLGKFYHFQRLAELYHGYHAIHRHTEDP (SEQ ID NO: 795); 
FSVHRPETLFNISRFLLHSLPKDTPSGISKVKILFT (SEQ ID NO:800); 
LAKQSKALGAYRLARHAYDKLRGLYIP (SEQ ID NO:796); ARFQKSIELG 
TLTIRAKPFHDSEELVPLCYRCSTNN (SEQ ID NO: 797); and/or PLLNNLGNVC 
INCRQPFIFSASSYDVLHLVEFYLEEGITDEEAISLIDLEVLRPKRDDRQLEICKQQ 
10 LPDSCG (SEQ ID NO:798). Polynucleotides encoding these polypeptides aie also 
encompassed by the invention. 

This gene is expressed primarily in testes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, male reproductive and endocrine disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the reproductive and endocrine systems, 

20 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

25 disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of male reproductive and endocrine 
disorders. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 207 
This gene is expressed in fetal lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
35 not limited to, lung diseases such as cystic fibrosis. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
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of the above tissues or cells, particularly of the respiratory system, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 440 as residues: Tyr-49 to Cys-54. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for detection and treatment of disorders associated 
with developing lungs particularly in premature infants where the lungs are the last 
tissues to develop. The tissue distribution indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and intervention of 
lung tumors since the gene may be involved in the regulation of cell division, 
particularly since it is expressed in fetal tissue. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and inmiunodierapy targets for 
the above listed tumors and tissues. 
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Table 1 summarizes the information corresponding to each "Gene No." described 
above. The nucleotide sequence identified as "NT SEQ ID NO:X" was assembled from 
partially homologous ("overlapping") sequences obtained from the "cDNA clone ID" 
identified in Table 1 and, in some cases, from additional related DNA clones. The 
overlapping sequences were assembled into a single contiguous sequence of high 
redundancy (usually three to five overlapping sequences at each nucleotide position), 
resulting in a final sequence identified as SEQ ID NO:X. 

The cDNA Clone ID was deposited on the date and given the corresponding 
deposit number listed in "ATCC Deposit No:Z and Date." Some of the deposits contain 
multiple different clones corresponding to the same gene. "Vector" refers to the type of 
vector contained m the cDNA Clone ID. 

. "Total NT Seq." refers to the total number of nucleotides in the contig identified 
by "Gene No." The deposited clone may contain all or most of these sequences, 
reflected by the nucleotide position indicated as "5' NT of Clone Seq." and the "3' NT 
of Clone Seq." of SEQ ID NO:X. The nucleotide position of SEQ ID NO:X of the 
putative start codon (methionine) is identified as "5* NT of Start Codon." Similarly , 
the nucleotide position of SEQ ID NO:X of the predicted signal sequence is identified as 
"5* NT of First AA of Signal Pep." 

The translated amino acid sequence, beginning with the methionine, is identified 
as "AA SEQ ID NO: Y " although other reading frames can also be easily translated 
using known molecular biology techniques. The polypeptides produced by these 
alternative open reading frames are specifically contemplated by the present invention. 

The first and last amino acid position of SEQ ID NO: Y of the predicted signal 
peptide is identified as "First AA of Sig Pep" and "Last AA of Sig Pep." The predicted 
first amino acid position of SEQ ID NO:Y of the secreted portion is identified as 
"Predicted First AA of Secreted Portion." Finally, the amino acid position of SEQ ID 
NO: Y of the last amino acid in the open reading frame is identified as '*Last AA of 
ORF." 

SEQ ID NO:X and the translated SEQ ID NO: Y arc sufficiently accurate and 
otherwise suitable for a variety of uses well known in the art and described further 
below. For instance. SEQ ID NO:X is useftil for designing nucleic acid hybridization 
probes that will detect nucleic acid sequences contained in SEQ ID NOrX or the cDNA 
contained in the deposited clone. These probes wUl also hybridize to nucleic acid 
molecules in biological samples, thereby enabling a variety of forensic and diagnostic 
methods of the invention. Similarly, polypeptides identified from SEQ ID NO: Y may 
be used to generate antibodies which bind specifically to the secreted proteins encoded 
by the cDN A clones identified in Table 1 . 
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Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or 
deletions of nucleotides in the generated DNA sequence. The erroneously inserted or 
deleted nucleotides cause frame shifts in the reading frames of the predicted amino acid 
5 sequence. In these cases, the predicted amino acid sequence diverges from the actual 
amino acid sequence, even though the generated DNA sequence may be greater than 
99.9% identical to the actual DNA sequence (for example, one base insertion or deletion 
in an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide 

10 sequence or the amino acid sequence, the present invention provides not only the 

generated nucleotide sequence identified as SEQ ID NO:X and the predicted translated 
amino acid sequence identified as SEQ ID NO:Y, but also a sample of plasmid DNA 
containing a human cDNA of the invention deposited with the ATCC, as set forth in 
Table 1. The nucleotide sequence of each deposited clone can readily be determined by 

15 ' sequencing the deposited clone in accordance with known methods. The predicted 
amino acid sequence can then be verified from such deposits. Moreover, the amino 
acid sequence of the protein encoded by a particular clone can also be directly 
determined by peptide sequencing or by expressing the protein in a suitable host cell 
containing the deposited human cDNA, collecting the protein, and determining its 

20 sequence. 

The present invention also relates to the genes corresponding to SEQ ID NO:X, 
SEQ ID NO: Y, or the deposited clone. The corresponding gene can be isolated in 
accordance with known methods using the sequence information disclosed herein. 
Such methods include preparing probes or primers from the disclosed sequence and 
25 identifying or amplifying the corresponding gene from appropriate sources of genomic 
material. 

Also provided in the present invention are species homologs. Species 
homologs may be isolated and identified by making suitable probes or primers fix)m the 
sequences provided herein and screening a suitable nucleic acid source for the desired 
30 homologue. 

The polypeptides of the invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
combination of these methods. Means for preparing such polypeptides are well 
35 understood in the art. 

The polypeptides may be in the form of the secreted protein, including the 
mature form, or may be a part of a larger protein, such as a fusion protein (see below). 
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It is often advantageous to include an additional amino acid sequence which contains 
secretory or leader sequences, pro-sequences, sequences which aid in purification , 
such as multiple histidine residues, or an additional sequence for stability during 
recombinant production. 
5 The polypeptides of the present invention are preferably provided in an isolated 

form, and preferably are substantially purified A recombinantiy produced version of a 
polypeptide, including the secreted polypeptide, can be substantially purified by the 
one-step method described in Smith and Johnson, Gene 67:31-40 (1988). 
Polypeptides of the invention also can be purified from natural or recombinant sources 
10 using antibodies of die invention raised against the secreted protein in metiiods which 
are well known in the art. 

Signal Sequences 

Methods for predicting whetiier a protein has a signal sequence, as well as the 

15 cleavage point for that sequence, are available. For instance, the metiiod of McGeoch, 
Virus Res. 3:27 1-286 (1985), uses the information from a short N-terminal charged 
region and a subsequent uncharged region of the complete (uncleaved) protein. The 
method of von Heinje, Nucleic Acids Res. 14:4683-4690 ( 1986) uses the information 
from the residues surrounding the cleavage site, typically residues -13 to +2, where +1 

20 indicates the amino terminus of the secreted protein. The accuracy of predicting the 
cleavage points of known mammalian secretory proteins for each of these methods is in 
the range of 75-80%. (von Heinje, supra.) However, the two methods do not always 
produce the same predicted cleavage point(s) for a given protein. 

In the present case, the deduced amino acid sequence of the secreted polypeptide 

25 was analyzed by a computer program called Signal? (Henrik Nielsen et al.. Protein 
Engineering 10:1-6 (1997)), which predicts tiie cellular location of a protein based on 
the amino acid sequence. As part of this computational prediction of localization, the 
methods of McGeoch and von Heinje are incorporated. The analysis of the amino acid 
sequences of the secreted proteins described herein by this program provided the results 

30 shown in Table 1. 

As one of ordinary skill would appreciate, however, cleavage sites sometimes 
vary from organism to organism and cannot be predicted with absolute certainty. 
Accordingly, the present invention provides secreted polypeptides having a sequence 
shown in SEQ ID NO: Y which have an N-terminus beginning within 5 residues (i.e., + 

35 or - 5 residues) of the predicted cleavage point. Similarly, it is also recognized that in 
some cases, cleavage of the signal sequence from a secreted protein is not entirely 
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uniform, resulting in more than one secreted species. These polypeptides, and the 
polynucleotides encoding such polypeptides, are contemplated by the present invention. 

Moreover, the signal sequence identified by the above analysis may not 
necessarily predict the naturally occurring signal sequence. For example, the naturally 
5 occurring signal sequence may be further upstream from the predicted signal sequence. 
However, it is likely that the predicted signal sequence will be capable ofdirecting the 
secreted protein to the ER. These polypeptides, and the polynucleotides encoding such 
polypeptides, are contemplated by the present invention. 

10 Polynucleotide and Polypeptide Variants 

"Variant" refers to a polynucleotide or polypeptide differing from the 
polynucleotide or polypeptide of the present invention, but retaining essential properties 
thereof. Generally, variants are overall closely similar, and, in many regions, identical 
to the polynucleotide or polypeptide of the present invention. 

15 By a polynucleotide having a nucleotide sequence at least, for example, 95% 

"identical" to a reference nucleotide sequence of the present invention, it is intended that 
the nucleotide sequence of the polynucleotide is identical to the reference sequence 
except that the polynucleotide sequence may include up to five point mutations per each 
100 nucleotides of the reference nucleotide sequence encoding the polypeptide. In other 

20 words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to 
a reference nucleotide sequence, up to 5% of the nucleotides in the reference sequence 
may be deleted or substituted with another nucleotide, or a number of nucleotides up to 
5% of the total nucleotides in the reference sequence may be inserted into the reference 
sequence. The query sequence may be an entire sequence shown inTable 1, the ORF 

25 (open reading frame), or any fragement specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or 
polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 
computer programs. A preferred method for determing the best overall match between 

30 a query sequence (a sequence of the present invention) and a subject sequence, also 
referred to as a global sequence alignment, can be determined using die FASTDB 
computer program based on the algorithni of Brudag et al. (Comp. App. Biosci. (1990) 
6:237-245). In a sequence alignment the query and subject sequences are botii DNA 
sequences. An RNA sequence can be compared by convertmg U*s to T's. The result 

35 of said global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB alignment of DNA sequences to calculate percent identiy are: 
Matrix=Unitary, k-tuple=4, Mismatch Penalty=l, Joining Penalty=30i Randomization 
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Group Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size Penalty 0.05, Window 
Size=500 or the lenght of the subject nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3* 
deletions, not because of uitemal deletions, a manual correction must be made to the 
5 results. This is becuase the FASTDB program does not account for 5' and 3' 
truncations of the subject sequence when calculating percent identity. For subject 
sequences truncated at the 5' or 3* ends, relative to the the query sequence, the percent 
identity is corrected by calculating the number of bases of the query sequence that are 5' 
, and 3' of the subject sequence, which are not matched/aligned, as a percent of the total 

10 bases of the query sequence. Whether a nucleotide is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from 
the percent identity, calculated by the above FASTDB program using the specified 
parameters, to arrive at a final percent identity score. This corrected score is what is 
used for the purposes of the present invention. Only bases outside the 5' and 3' bases 

15 of the subject sequence, as displayed by the FASTDB alignment, which are not 

matched/aligned with the queiy sequence, are calculated for the purposes of manually 
adjusting the percent identity score. 

For example, a 90 base subject sequence is aligned to a 100 base query 
sequence to determme percent identity. The deletions occur at the 5' end of the subject 

20 sequence and therefore, the FASTDB alignment does not show a matched/alignement of 
the first 10 bases at 5' end. The 10 unpaued bases represent 10% of the sequence 
(number of bases at the 5' and 3' ends not matched/total number of bases in the query 
sequence) so 10% is subtracted from the percent identity score calculated by the 
FASTDB program. If the remaining 90 bases were perfectly matched the final percent 

25 identity would be 90%. In another example, a 90 base subject sequence is compared 
with a 100 base query sequence. This time the deletions are internal deletions so that 
there are no bases on the 5* or 3' of the subject sequence which are not matched/aligned 
with the query. In this case the percent identity calculated by FASTDB is not manually 
corrected. Once again, only bases 5' and 3' of the subject sequence which are not 

30 matched/aligned with the query sequnce are? manually corrected for. No other manual 
corrections are to made for the purposes of the iwresent invention. 

By a polypeptide having an amino acid sequence at least, for exanq)le, 95% 
"identical" to a query amino acid sequence of the present invention, it is intended that 
the amino acid sequence of the subject polypeptide is identical to the query sequence 

35 except that the subject polypeptide sequence may include up to five amino acid 

alterations per each 100 amino acids of the query amino acid sequence. In other words, 
to obtain a polypeptide having an amino acid sequence at least 95% identical to a query 
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amino acid sequence, up to 5% of the amino acid residues in the subject sequence may 
be inserted, deleted, (indels) or substituted with another amino acid. These alterations 
of the reference sequence may occur at the amino or carboxy terminal j)Ositions of the 
reference amino acid sequence or anywhere between those terminal positions, 
5 interspersed either individually among residues in the reference sequence or in one or 
more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 95%, 
96%, 97%, 98% or 99% identical to, for instance, the amino acid sequences shown in 
Table 1 or to the amino acid sequence encoded by deposited DNA clone can be 

10 determined conventionally using known computer programs. A preferred method for 
determing the best overall match between a query sequence (a sequence of the present 
invention) and a subject sequence, also referred to as a global sequence alignment, can 
be determined using the FASTDB computer program based on the algorithm of Brutlag 
et al. (Comp. App. Biosci. (1990) 6:237-245). In a sequence alignment the query and 

15 subject sequences are either both nucleotide sequences or both amino acid sequences. 
The result of said global sequence alignment is in percent identity. Preferred parameters 
used in a FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2. Mismatch 
Penalty=l, Joining Penalty=20, Randomization Group Length=0, Cutoff Score=l, 
Window Sizessequence length. Gap Penalty=5. Gap Size Penalty=0.05, Window 

20 Size=500 or the length of the subject amino acid sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
terminal deletions, not because of internal deletions, a manual correction must be made 
to the results. This is becuase the FASTDB program does not account for N- and C- 
terminal truncations of the subject sequence when calculating global percent identity. 

25 For subject sequences truncated at the N- and C-termini, relative to the the query 

sequence, the percent identity is corrected by calculating the number of residues of the 
query sequence that are N- and C-terminal of the subject sequence, which are not 
matched/aligned with a corresponding subject residue, as a percent of the total bases of 
the query sequence. Whether a residue is matched/aligned is determined by results of 

30 the FASTDB sequence alignment This percentage is then subtracted fioni the percent 
identity, calculated by the above FASTDB program using the specified parameters, to 
arrive at a final percent identity score. This final percent identity score is what is used 
for the purposes of the present invention. Only residues to the N- and C-termini of the 
subject sequence, which are not matched/aligned with the query sequence, are 

35 considered for the purposes of manually adjusting the percent identity score. That is, 
only query residue positions outside the farthest N- and C-terminal residues of the 
subject sequence. 
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For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 
terminus of the subject sequence and therefore, the FASTDB alignment does not show 
a matching/alignment of the first 10 residues at the N-terminus. The 10 unpaked 
residues represent 10% of the sequence (number of residues at the N- and C- termini 
not matched/total number of residues in the query sequence) so 10% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 90 
residues were perfectly matched the final percent identity would be 90%. In another 
example, a 90 residue subject sequence is compared with a 100 residue query sequence. 
This time the deletions are internal deletions so there are no residues at the N- or C- 
termini of the subject sequence which are not matched/aligned with the query. In this 
case the percent identity calculated by FASTDB is not manually corrected Once again, 
only residue positions outside the N- and C-terminal ends of the subject sequence, as 
displayed in the FASTDB alignment, which are not matched/aligned with the query 
sequnce are manually corrected for. No other manual corrections are to made for the 
purposes of the present invention. 

The variants may contain alterations in the coding regions, non-coding regions, 
or both. Especially preferred are polynucleotide variants containing alterations which 
produce silent substitutions, additions, or deletions, but do not alter the properties or 
activities of the encoded polypeptide. Nucleotide variants produced by silent 
substitutions due to the degeneracy of the genetic code are preferred. Moreover, 
variants in which 5-10, 1-5, or 1-2 amino acids are substituted, deleted, or added in any 
combination are also preferred. Polynucleotide variants can be produced for a variety 
of reasons, e.g., to optimize codon expression for a particular host (change codons in 
the human mRNA to those preferred by a bacterial host such as E. coli). 

Naturally occurring variants are called "allelic variants," and refer to one of 
several alternate forms of a gene occupying a given locus on a chromosome of an 
organism. (Genes H. Lewin, B.. ed., John Wiley & Sons, New York (1985).) These 
allelic variants can vary at either the polynucleotide and/or polypeptide level. 
Alternatively, non-naturally occurring variants may be produced by mutagenesis 
techniques or by direct synthesis. 

Using known methods of protein engineering and recombinant DNA 
technology, variants may be generated to improve or alter the characteristics of the 
polypeptides of the present invention. For instance, one or more amino acids can be 
deleted from the N-terminus or C-terminus of the secreted protein without substantial 
loss of biological function. The authors of Ron et al., J. Biol. Chem. 268: 2984-2988 
(1993), reported variant KGF proteins having heparin binding activity even after 
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deleting 3, 8, or 27 amino-terminal amino acid residues. Similarly, Interferon gamma 
exhibited up to ten times higher activity after deleting 8-10 amino acid residues from the 
carboxy terminus of this protein. (Dobeh et al., J. Biotechnology 7:199-216 (1988).) 
Moreover, ample evidence demonstrates that variants often retain a biological 
5 activity similar to that of the naturally occurring protein. For example, Gaylc and 
coworkers (J. BioL Chem 268:22105-221 1 1 (1993)) conducted extensive mutational 
analysis of human cytokine IL-la. They used random mutagenesis to generate over 
3,500 individual IL-la mutants that averaged 2.5 amino acid changes per variant over 
the entire length of the molecule. Multiple mutations were examined at every possible 
10 amino acid position. The investigators found that "[mjost of the molecule could be 
altered with little effect on either [binding or biological activity].". (See, Abstract.) In 
fact, only 23 unique amino acid sequences, out of more than 3,500 nucleotide 
sequences examined, produced a protein that significantly differed in activity from wild- 
type. 

1 5 Furthermore, even if deleting one or more amino acids from the N-terminus or 

C-terminus of a polypeptide results in modification or loss of one or more biological 
functions, other biological activities may still be retained. For example, the ability of a 
deletion variant to induce and/or to bind antibodies which recognize the secreted form 
will likely be retained when less than the majority of the residues of the secreted form 

20 are removed from the N-terminus or C-temiinus. Whether a pardcular polypeptide 
lacking N- or C-terminal residues of a protein retains such immunogenic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. 

Thus, the invention further includes polypeptide variants which show 
25 substantial biological activity. Such variants include deletions, insertions, inversions, 
repeats, and substitutions selected according to general rules known in the art so as 
have litde effect on activity. For example, guidance concerning how to make 
phenotypically silent ammo acid substitutions is provided in Bowie, J. ts. et al.. 
Science 247: 1306-1310 (1990), wherein the authors indicate that there are two main 
30 strategies for studying the tolerance of an amino acid sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
selection during the process of evolution. By comparing amino acid sequences in 
different species, conserved amino acids can be identified. These conserved amino 
acids are likely important for protein fiinction. In contrast, the amino acid positions 
35 where substitutions have been tolerated by natural selection indicates that these 

positions are not critical for protein function. Thus, positions tolerating amino acid 
substitution could be modified while still maintaining biological activity of the protein. 
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The second strategy uses genetic engineering to introduce amino acid changes at 
specific positions of a cloned gene to identify regions critical for protein function. For 
example, site directed mutagenesis or alanine-scanning mutagenesis (introduction of 
single alanine mutations at every residue in the molecule) can be used. (Cunningham 
5 and Wells, Science 244: 1081-1085 (1989).) The resulting mutant molecules can then 
be tested for biological activity. 

As the authors state, these two strategies have revealed that proteins are 
surprisingly tolerant of amino acid substitutions. The authors further indicate which 
amino acid changes are likely to be permissive at certain amino acid positions in the 

10 protein. For example, most buried (within the tertiary structure of the protein) amino 
acid residues require nonpolar side chains, whereas few features of surface side chains 
are generally conserved. Moreover, tolerated conservative amino acid substitutions 
involve replacement of the aliphatic or hydrophobic amino acids Ala, Val, Leu and lie; 
replacement of the hydroxyl residues Ser and Thr, replacement of the acidic residues 

1 5 Asp and Glu; replacement of the amide residues Asn and Gin, replacement of the basic 
residues Lys, Arg, and His; replacement of the aromatic residues Phe, Tyr, and Trp, 
and replacement of the small-sized amino acids Ala, Ser, Thr, Met, and Gly. 

Besides conservative amino acid substitution, variants of the present invention 
include (i) substitutions with one or more of the non-conserved ammo acid residues, 

20 where the substituted amino acid residues may or may not be one encoded by the 
genetic code, or (ii) substitution with one or more of amino acid residues having a 
substituent group, or (iii) fusion of the mature polypeptide with another compound, 
such as a compound to increase the stability and/or solubility of the polypeptide (for 
example, polyethylene glycol), or (iv) fusion of the polypeptide with additional amino 

25 acids, such as an IgG Fc fusion region peptide, or leader or secretory sequence, or a 
sequence facilitating purification. Such variant polypeptides are deemed to be within 
the scope of those skilled in the art from the teachings herein. 

For example, polypeptide variants containing amino acid substitutions of 
charged amino acids with other charged or neutral amino acids may produce proteins 

30 with improved characteristics, such as less aggregation. Aggregation of pharmaceutical 
formulations both reduces activity and increases clearance due to the aggregate's 
immunogenic activity. (Pinckard et al., Clin. Exp. Immunol. 2:331-340 (1967); 
Robbins et al.. Diabetes 36: 838-845 (1987); Cleland et al., Crit. Rev. Therapeutic 
Drug Carrier Systems 10:307-377 (1993).) 

35 



wo 98/54963 



PCT/US98/11422 



198 

Polynucle otide and Polypeptide Fragments 

In the present invention, a "polynucleotide fragment" refers to a short 
polynucleotide having a nucleic acid sequence contained in the deposited clone or 
shown in SEQ ID NO:X. The short nucleotide fragments are preferably at least about 
5 1 5 nt, and more preferably at least about 20 nt, still more preferably at least about 30 nt, 
and even more preferably, at least about 40 nt in length. A fragment "at least 20 nt in 
length," for example, is intended to include 20 or more contiguous bases from the 
cDNA sequence contained in the deposited clone or the nucleotide sequence shown in 
SEQ ID NO:X. These nucleotide fragments are useful as diagnostic probes and primers 

10 as discussed herein. Of course, larger fragments (e.g., 50, 150, 500, 600, 2000 
nucleotides) are preferred. 

Moreover, representative examples of polynucleotide fragments of the 
invention, include, for example, fragments having a sequence from about nucleotide 
number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401- 

15 450, 451-500, 501-550, 551-600, 651-700, 701-750, 751-800, 800-850, 851-900, 
901-950, 951-1000, 1001-1050, 1051-1100, 1101-1150, 1151-1200, 1201-1250, 
1251-1300, 1301-1350, 1351-1400, 1401-1450, 1451-1500, 1501-1550, 1551-1600, 
1601-1650, 1651-1700, 1701-1750, 1751-1800, 1801-1850, 1851-1900, 1901-1950, 
195 1-2000, or 2001 to the end of SEQ ID NO:X or the cDNA contained in the 

20 deposited clone. In this context "about" includes the particularly recited ranges, larger 
or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. 
Preferably, these fragments encode a polypeptide which has biological activity. More 
preferably, these polynucleotides can be used as probes or primers as discussed herein. 
In the present invention, a "polypeptide fragment" refers to a short amino acid 

25 sequence contained in SEQ ID NO: Y or encoded by the cDNA contained in the 

deposited clone. Protein fragments may be "free-standing," or comprised within a 
larger polypeptide of which the fragment forms a part or region, most preferably as a 
single continuous region. Representative examples of polypeptide fragments of the 
invention, include, for example, fragments from about amino acid number 1-20, 21-40, 

30 41-60, 61-80, 81-100, 102-120, 121-140, 141-160, or 161 to the end of the coding 
region. Moreover, polypeptide fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 
100, 1 10, 120, 130, 140, or 150 amino acids in length. In this context "about" 
includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
amino acids, at either extreme or at both extremes. 

35 Preferred polypeptide fragments include the secreted protein as well as the 

mature form. Further preferred polypeptide fragments include the secreted protein or 
the mature form having a continuous series of deleted residues from the amino or the 
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carboxy terminus, or both. For example, any number of amino acids, ranging from 1- 
60, can be deleted from the amino terminus of either the secreted polypeptide or the 
mature form. Similarly, any number of amino acids, ranging from 1-30, can be deleted 
from the carboxy terminus of the secreted protein or mature form. Furthermore, any 
5 combination of the above amino and carboxy terminus deletions are preferred. 
Similarly, polynucleotide fragments encoding these polypeptide fragments are also 
preferred. 

Particularly, N-terminal deletions of the polypeptide of the present invention can 
be described by the general formula m-p, where p is the total number of amino acids in 
10 the polypeptide and m is an integer from 2 to (p-1), and where both of these integers (m 
& p) correspond to the position of the amino acid residue identified in SEQ ID NO: Y. 

Moreover, C-terminal deletions of the polypeptide of the present invention can 
also be described by the general formula 1-n, where n is an integer from 2 to (p-1), and 
again where these integers (n & p) correspond to the position of the amino acid residue 
15 identified in SEQ ID NO: Y. 

The invention also provides polypeptides having one or more amino acids 
deleted from both the amino and the carboxyl termini, which may be described 
generally as having residues m-n of SEQ ID NO: Y, where m and n are integers as 
described above. 

20 Also preferred are polypeptide and polynucleotide fi-agments characterized by 

structural or functional domains, such as fragments that comprise alpha-helix and alpha- 
helix forming regions, beta-sheet and beta-sheet-forming regions, turn and turn- 
forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic 
regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, surface- 

25 forming regions, substrate binding region, and high antigenic index regions. 
Polypeptide fragments of SEQ ID NO: Y falling within conserved domains are 
specifically contemplated by the present invention. Moreover, polynucleotide 
firagments encoding these domains are also contemplated. 

Other preferred fragments are biologically active fragments. Biologically active 

30 fragments are diose exhibiting activity similar, but not necessarily identical, to an 
activity of the polypeptide of the present invention. The biological activity of tiie 
fragments may include an improved desired activity, or a decreased undesirable activity. 

gpitopeg? & Antibodies 

35 In the present invention, "epitopes" refer to polypeptide firagments having 

antigenic or inununogenic activity in an animal, especially in a human. A preferred 
embodiment of the present invention relates to a polypeptide fragment comprising an 
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epitope, as well as the polynucleotide encoding this fragment. A region of a protein 
molecule to which an antibody can bind is defined as an "antigenic epitope." In 
contrast, an "immunogenic epitope" is defined as a part of a protein that elicits an 
antibody response. (See, for instance, Geysen et aL, Proc. Natl. Acad. Sci. USA 
5 81:3998-4002(1983).) 

Fragments which function as epitopes may be produced by any conventional 
means. (See, e.g., Houghten, R. A., Proc. Natl. Acad. Sci. USA 82:5131-5135 
(1985) further described in U.S. Patent No. 4,631,21 1.) 

In the present invention, antigenic epitopes preferably contain a sequence of at 

10 least seven, more preferably at least nine, and most preferably between about 15 to 
about 30 amino acids. Antigenic epitopes are useful to raise antibodies, including 
monoclonal antibodies, that specifically bind the epitope. (See, for instance, Wilson et 
al.. Cell 37:767-778 (1984); Sutcliffe, J. G. et al.. Science 219:660-666 (1983).) 

Similarly, immunogenic epitopes can be used to induce antibodies according to 

15 methods well known m the art. (See, for instance, Sutcliffe et al., supra; Wilson et al., 
supra; Chow, M. et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F. J. et 
al., J.Gen. Virol. 66:2347-2354(1985).) A preferred immunogenic epitope includes 
the secreted protein. The inmiunogenic epitopes may be presented together with a ' 
carrier protein, such as an albumin, to an animal system (such as rabbit or mouse) or, if 

20 it is long enough (at least about 25 amino acids), without a carrier. However, 

immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to be 
sufficient to raise antibodies capable of binding to, at the very least, linear epitopes in a 
denatured polypeptide (e.g., in Western blotting.) 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 

25 meant to include intact molecules as well as antibody fragments (such as, for example, 
Fab and F(ab')2 fragments) which are capable of specifically binding to protein. Fab 
and F(ab')2 fragments lack the Fc fragment of intact antibody, clear more rapidly fi-om 
the circulation, and may have less non-specific tissue binding than an intact antibody. 
(Wahl et al., J. Nucl. Med. 24:316-325 (1983).) Thus, these fragments are preferred, 

30 as well as the products of a FAB or other immunoglobulin expression library. 
Moreover, antibodies of the present invention include chimeric, single chain, and 
humanized antibodies. 

Fusion Proteins 

35 Any polypeptide of the present invention can be used to generate fusion 

proteins. For example, the polypeptide of the present invention, when fused to a 
second protein, can be used as an antigenic tag. Antibodies raised against the 
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polypeptide of the present invention can be used to indirectly detect the second protein 
by binding to the polypeptide. Moreover, because secreted proteins target cellular 
locations based on trafficking signals, the polypeptides of the present invention can be 
used as targeting molecules once fused to other proteins. 
5 Examples of domains that can be fused to polypeptides of the present invention 

include not only heterologous signal sequences, but also other heterologous functional 
regions. The fusion does not necessarily need to be direct, but may occur through 
linker sequences. 

Moreover, fusion proteins may also be engineered to improve characteristics of 

10 the polypeptide of the present invention. For instance, a region of additional amino 
acids, particularly charged annino acids, may be added to the N-terminus of the 
polypeptide to improve stability and persistence during purification from the host cell or 
subsequent handling and storage. Also, peptide moieties may be added to the 
polypeptide to facilitate purification. Such regions may be removed prior to final 

15 preparation of the polypeptide. The addition of peptide moieties to facilitate handling of 
polypeptides aire familiar and routine techniques in the ait. 

Moreover, polypeptides of the present invention, including fragments, and 
specifically epitopes, can be combined with parts of the constant domain of 
immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 

20 facilitate purification and show an increased halMife in vivo. One reported example 
describes chimeric proteins consisting of the first two domains of the human CD4- 
polypeptide and various domains of the constant regions of the heavy or light chains of 
mammalian immunoglobulins. (EP A 394,827; Traunecker et al.. Nature 331:84-86 
(1988).) Fusion proteins having disulfide-hnked dimeric structures (due to the IgG) 

25 can also be more efficient in binding and neutralizing other molecules, than the 
monomeric secreted protein or protein fragment alone. (Fountoulakis et al., J. 
Biochem. 270:3958-3964 (1995).) 

Similarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fiision 
proteins comprising various portions of constant region of immunoglobulin molecules 

30 together with another human protein or part thereof. In many cases, the Fc part in a 
fusion protein is beneficial in therapy and diagnosis, and thus can result in, for 
example, improved pharmacokinetic properties. (EP-A 0232 262.) Alternatively, 
deleting the Fc part after the fusion protein has been expressed, detected, and purified, 
would be desired. For example, the Fc portion may hinder therapy and diagnosis if the 

35 fusion protein is used as an antigen for immunizations. In drug discovery, for 
example, human proteins, such as hIL-5, have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists of hIL-5. (See, D. 
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Bennett et al., J. Molecular Recognition 8:52-58 (1995); K. Johanson et al., J. Biol. 
Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of the present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of the fused polypeptide. In 
5 preferred embodiments, the marker amino acid sequence is a hexa-histidine peptide, 
such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, 
Chatsworth, CA, 91311), among others, many of which are commercially available. 
As described in Gentz et al., Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for 
instance, hexa-histidine provides for convenient purification of the fusion protein. 
10 Another peptide tag useful for purification, the "HA" tag, corresponds to an epitope 
derived from the influenza hemagglutinin protein. (Wilson et al., Cell 37:767 (1984).) 

Thus, any of these above fusions can be engineered using the polynucleotides 
or the polypeptides of the present invention. 

15 Vectors. Host Cells, and Protein Prndiigtinn 

The present invention also relates to vectors containing the polynucleotide of the 
present invention, host cells, and the production of polypeptides by reconibinant 
techniques. The vector may be, for example, a phage, plasmid, viral, or retroviral 
vector. Retroviral vectors may be replication competent or replication defective. In the 

20 latter case, viral propagation generally will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker for 
propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such 
as a calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is 
a virus, it may be packaged in vitro using an appropriate packaging cell line and then 

25 transduced into host cells. 

The polynucleotide insert should be operatively linked to an appropriate 
promoter, such as the phage lambda PL promoter, the E. coli lac. trp, phoA and tac 
promoters, the SV40 early and late promoters and promoters of retroviral LTRs, to 
name a few. Other suitable promoters will be known to the skilled artisan. The 

30 expression constracts will further contain sites for transcription initiation, termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding 
portion of the transcripts expressed by the constracts will preferably include a 
translation initiating codon at the beginning and a termination codon (UAA, UGA or 
UAG) appropriately positioned at the end of the polypeptide to be translated. 

35 As indicated, the expression vectors will preferably include at least one 

selectable marker. Such markers include dihydrofolate reductase, G418 or neomycin 
resistance for eukaiyotic cell culture and tetracyclme, kanamycin or ampicillin resistance 
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genes for culturing in E. coli and other bacteria. Representative examples of 
appropriate hosts include, but are not liniited to, bacterial cells, such as E. coli, 
Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect 
cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO. COS, 
5 293, and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, 
available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, 
pNH16a, pNHlSA, pNH46A, available from Stratagene Cloning Systems, Inc.; and 

10 ptrc99a, pKK223-3, pKK233-3, pDR540, pRTTS available from Pharmacia Biotech, 
Inc. Among preferred eukaryotic vectors are pWLNEO,pSV2CAT,pOG44,pXTl 
and pSG available from Stratagene; and pS VK3, pBPV, pMSG and pSVL available 
from Pharmacia Other suitable vectors will be readily apparent to the skilled artisan. 
Introduction of the construct into the host cell can be effected by calcium 

15 phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporation, transduction, infection, or other methods. Such methods 
are described in many standard laboratory manuals, such as Davis et al., Basic Methods 
In Molecular Biology (1986). It is specifically contemplated that the polypeptides of the 
present invention may in fact be expressed by a host cell lacking a recombinant vector, 

20 A polypeptide of this invention can be recovered and purified fix)m recombinant 

cell cultures by well-known methods including anmionium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most 

25 preferably, high performance liquid chromatography ("HPLC") is employed for 
purification. 

Polypeptides of die present invention, and preferably the secreted form, can also 
be recovered from: products purified fix)m natural sources, including bodily fluids, 
tissues and cells, whether directly isolated or cultured; products of chemical synthetic 

30 procedures; and products produced by recombinant techniques from a prokaryotic or 
eukaiyotic host, including, for example, bacterial, yeast, higher plant, insect, and 
mammalian cells. Depending upon the host employed in a recombinant production 
procedure, the polypeptides of the present invention may be glycosylated or may be 
non-glycosylated. In addition, polypeptides of the mvention may also include an initial 

35 modified methionine residue, in some cases as a result of host-mediated processes. 
Thus, it is well known in the art that the N-terminal methionine encoded by the 
translation initiation codon generally is removed with high efficiency from any protein 



wo 98/54963 



204 



PCT/US98/11422 



after translation in all eukaiyotic cells. While the N-temiinal methionine on most 
proteins also is efficiently removed in most prokaryotes, for some proteins, this 
prokaryotic removal process is inefficient, depending on the nature of the amino acid to 
which the N-terminal methionine is covalently linked. 

Uses of the Polynucleotides 

Each of the polynucleotides identified herein can be used in numerous ways as 
reagents. The following description should be considered exemplary and utihzes 
known techniques. 

The polynucleotides of the present invention are useful for chromosome 
identification. There exists an ongoing need to identify new chromosome markers, 
since few chromosome marking reagents, based on actual sequence data (repeat 
polymorphisms), are presently available. Each polynucleotide of the present invention 
can be used as a chromosome marker. 

Briefly, sequences can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp) from the sequences shown in SEQ ID NO:X. Primers can be 
selected using computer analysis so that primers do not span more than one predicted 
exon in the genomic DN A. These primers are then used for PCR screening of somatic 
cell hybrids containing individual human chromosomes. Only those hybrids containing 
the human gene corresponding to the SEQ ID NO:X will yield an amplified fragment. 

Similarly, somatic hybrids provide a rapid method of PCR mapping the 
polynucleotides to particular chromosomes. Three or more clones can be assigned per 
day using a single thermal cycler. Moreover, sublocalization of the polynucleotides can 
be achieved with panels of specific chromosome fragments. Other gene mapping 
strategies that can be used include in situ hybridization, prescreening with labeled flow- 
sorted chromosomes, and preselection by hybridization to construct chromosome 
specific-cDNA libraries. 

Precise chromosomal location of the polynucleotides can also be achieved using 
fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. This 
technique uses polynucleotides as short as 500 or 600 bases; however, polynucleotides 
2,000-4,000 bp are preferred For a review of this technique, see Verma et al., 
"Human Chromosomes: a Manual of Basic Techniques," Pergamon Press, New York 
(1988). 

For chromosome mapping, the polynucleotides can be used individually (to 
mark a single chromosome or a single site on that chromosome) or in panels (for 
marking multiple sites and/or multiple chromosomes). Preferred polynucleotides 
correspond to the noncoding regions of the cDNAs because the coding sequences are 
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more likely conserved within gene families, thus increasing the chance of cross 
hybridi2:ation during chromosomal mapping. 

Once a polynucleotide has been mapped to a precise chromosomal location, the 
physical position of the polynucleotide can be used in linkage analysis. Linkage 
5 analysis establishes coinheritance between a chromosomal location and presentation of a 
particular disease. (Disease mapping data are found, for example, in V. McKusick, 
Mendelian Inheritance in Man (available on line through Johns Hopkins University 
Welch Medical Library) .) Assuming 1 megabase mapping resolution and one gene per 
20 kb, a cDN A precisely localized to a chromosomal region associated with the disease 

10 could be one of 50-500 potential causative genes. 

Thus, once coinheritance is established, differences in the polynucleotide and 
the corresponding gene between affected and unaffected individuals can be examined. 
iFirst, visible structural alterations in the chromosomes, such as deletions or 
translocations, are examined in chromosome spreads or by PGR. If no structural 

15 alterations exist, the presence of point mutations are ascertained. Mutations observed in 
some or all affected individuals, but not in normal individuals, indicates that the 
mutation may cause the disease. However, complete sequencing of the polypeptide and 
the corresponding gene from several normal individuals is required to distinguish the 
mutation from a polymorphism. If a new polymorphism is identified, this polymorphic 

20 polypeptide can be used for further linkage analysis. 

Furthermore, increased or decreased expression of the gene in affected 
individuals as compared to unaffected individuals can be assessed using 
polynucleotides of the present invention. Any of these alterations (altered expression, 
chromosomal rearrangement, or mutation) can be used as a diagnostic or prognostic 

25 marker. 

In addition to the foregoing, a polynucleotide can be used to control gene 
expression through triple helix formation or antisense DNA or RNA. Both methods 
rely on binding of the polynucleotide to DNA or RNA. For these techniques, preferred 
polynucleotides are usually 20 to 40 bases in length and complementary to either the 

30 region of the gene involved in transcription (triple helix - see Lee et jal., NucL Acids 
Res. 6:3073 (1979); Cooney et al.. Science 241:456 (1988); and Dervan et al.. Science 
251:1360 (1991) ) or to the mRNA itself (anUsense - Okano, J, Neurochem. 56:560 
(1991); Oligodeoxy-nucleotides as Antisense Inhibitors of Gene Expression, CRC 
Press, Boca Raton, FL (1988).) Triple helix formation optimally results in a shut-off 

35 of RNA transcription from DNA, while antisense RNA hybridization blocks translation 
of an mRNA molecule into polypeptide. Both techniques arc effective in model 



wo 98/54963 



PCT/US98/11422 



206 

systems, and the infonnation disclosed herein can be used to design anUsense or triple 
helix polynucleotides in an effort to treat disease. 

PolynucleoUdes of the present invention are also useful in gene therapy. One 
goal of gene therapy is to insert a nomud gene into an organism having a defective 
gene, in an effort to correct the genetic defect. The polynucleotides disclosed in the 
present invention offer a means of targeting such genetic defects in a highly accurate 
manner. Another goal is to insert a new gene that was not present in the host genome, 
thereby producing a new trait in the host cell. 

The polynucleotides are also useful for identifying individuals from minute 
biological samples. The United States military, for example, is considering the use of 
restriction fragment length polymorphism (RFLP) for identification of its personnel. In 
this technique, an individual's genomic DNA is digested with one or more resu-iction 
enzymes, and probed on a Southern blot to yield unique bands for identifying 
personnel. This method does not suffer from the current limitations of "Dog Tags" 
which can be lost, switched, or stolen, making positive identification difficult. The 
polynucleotides of the present invention can be used as additional DNA markers for 
RFLP. 

The polynucleotides of the present invention can also be used as an alternative to 
RFLP, by determining tiie actual base-by-base DNA sequence of selected portions of an 
individual's genome. These sequences can be used to prepare PCR primers for 
amplifying and isolating such selected DNA, which can tiien be sequenced. Using this 
technique, individuals can be identified because each individual will have a unique set 
of DNA sequences. Once an unique ID database is established for an individual, 
positive identification of tiiat individual, living or dead, can be made from extremely 
small tissue samples. 

Forensic biology also benefits from using DNA-based identification techniques 
as disclosed herein. DNA sequences taken from very small biological samples such as 
tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, etc., can be 
amplified using PCR. In one prior art technique, gene sequences amplified ftom 
polymorphic loci, such as DQa class H HLA gene, are used in forensic biology to 
identify individuals. (Erlich, H., PCR Technology, Freeman and Co. (1992).) Once 
these specific polymorphic loci are amplified, they arc digested with one or more 
restriction enzymes, yielding an identifying set of bands on a Soudiem blot probed witii 
DNA corresponding to die DQa class H HLA gene. Similarly, polynucleotides of the 
present invention can be used as polymoiphic markers for forensic purposes. 

There is also a need for reagents capable of identifying the source of a particular 
tissue. Such need arises, for example, in forcnisics when presented witii tissue of 
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unknown origin. Appropriate reagents can comprise, for example, DNA probes or 
primers specific to particular tissue prepared from the sequences of the present 
invention. Panels of such reagents can identify tissue by species and/or by organ type. 
In a similar fashion, these reagents can be used to screen tissue cultures for 
S contamination. 

In the very least, the polynucleotides of the present invention can be used as 
molecular weight markers on Southern gels, as diagnostic probes for the presence of a 
specific mRNA in a particular* cell type, as a probe to "subtract-out" known sequences 
in the process of discovering novel polynucleotides, for selecting and making oligomers 
10 for attachment to a "gene chip" or other support, to raise anti-DNA antibodies using 
DNA inmiunization techniques, and as an antigen to elicit an immune response; 

Uses of the Polypeptides 

Each of the polypeptides identified herein can be used in numerous ways. The 

15 following description should be considered exemplary and utilizes known techniques. 

A polypeptide of the present invention can be used to assay protein levels in a 
biological sample using antibody-based techniques. For example, protein expression in 
tissues can be studied with classical immunohistological methods. (Jalkanen, M., et 
al., J. Cell. Biol. 101:976-985 (1985); Jalkanen, M., et al., J. CeU . Biol. 105:3087- 

20 3096 ( 1 987).) Other antibody-based methods useful for detecting protein gene 

expression include immimoassays, such as the enzyme linked inmiunosorbent assay 
(ELISA) and the radioimmunoassay (RIA). Suitable antibody assay labels are known 
in the art and include enzyme labels, such as, glucose oxidase, and radioisotopes, such 
as iodine (1251, 1211), carbon (14C), sulfur (35S), tritium (3H), indium (1 12In), and 

25 technetium (99mTc), and fluorescent labels, such as fluorescein and rhodamine, and 
biotin. 

In addition to assaying secreted protein levels in a biological sample, proteins 
can also be detected in vivo by imaging. Antibody labels or markers for m vivo 
imaging of protein include those detectable by X-radiography, NMR or ESR. For X- 
30 radiography, suitable labels include radioisotopes such as barium or cesium, which emit 
detectable radiation but are not overtly harmful to the subject. Suitable markers for 
NMR and ESR include those with a detectable characteristic spin, such as deuterium, 
which may be incorporated into the antibody by labeling of nutrients for the relevant 
hybridoma. 

35 A protein-specific antibody or antibody fragment which has been labeled with 

an appropriate detectable imaging moiety, such as a radioisotope (for example, 1311, 
1 12In, 99mTc), a radio-opaque substance, or a material detectable by nuclear magnetic 
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resonance, is introduced (for example, parenterally, subcutaneously, or 
intraperitoneally) into the mammal. It will be understood in the art that the size of the 
subject and the imaging system used will determine the quantity of imaging moiety 
needed to produce diagnostic images. In the case of a radioisotope moiety, for a human 
5 subject, the quantity of radioactivity injected will normally range from about 5 to 20 
millicuries of 99mTc. The labeled antibody or antibody fragment will then 
preferentially accumulate at the location of cells which contain the specific protein. In 
vivo tumor imaging is described in S.W. Burchiel et al., "Immunopharmacokinetics of 
Radiolabeled Antibodies and Their Fragments." (Chapter 13 in Tumor Imaging: The 

10 Radiochemical Detection of Cancer, S.W. Burchiel and B. A. Rhodes, eds., Masson 
Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which involves 
(a) assaying the expression of a polypeptide of the present invention in cells or body 
fluid of an individual; (b) comparing the level of gene expression with a standard gene 

15 expression level, whereby an increase or decrease in the assayed polypeptide gene 
expression level compared to the standard expression level is indicative of a disorder. 

Moreover, polypeptides of the present invention can be used to treat disease. 
For example, patients can be administered a polypeptide of the present invention in an 
effort to replace absent or decreased levels of the polypeptide (e.g., insulin), to 

20 supplement absent or decreased levels of a different polypeptide (e.g., hemoglobin S 
for hemoglobin B), to inhibit the activity of a polypeptide (e.g., an oncogene), to 
activate the activity of a polypeptide (e.g., by binding to a receptor), to reduce the 
activity of a membrane bound receptor by competing with it for free ligand (e.g., 
soluble TNF receptors used in reducing inflammation), or to bring about a desired 

25 response (e.g., blood vessel growth). 

Similarly, antibodies directed to a polypeptide of the present invention can also 
be used to treat disease. For example, administration of an antibody directed to a 
polypeptide of the present invention can bind and reduce overproduction of the 
polypeptide. Similarly, administration of an antibody can activate the polypeptide, such 

30 as by binding to a polypeptide bound to a membrane (receptor). 

At the very least, the polypeptides of the present invention can be used as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 
columns using methods well known to those of skill in the art. Polypeptides can also 
be used to raise antibodies, which in turn are used to measure protein expression from a 

35 recombinant cell, as a way of assessing transformation of the host cell. Moreover, the 
polypeptides of the present invention can be used to test the following biological 
activities. 
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Biological Activities 

The polynucleotides and pplypeptides of the present invention can be used in 
assays to test for one or more biological activities. If these polynucleotides and 
polypeptides do exhibit activity in a particular assay, it is likely that these molecules 
may be involved in the diseases associated with the biological activity. Thus, the 
polynucleotides and polypeptides could be used to treat the associated disease. 

Immune Activity 

A polypeptide or polynucleotide of the present invention may be useful in 
treating deficiencies or disorders of the immune system, by activating or inhibiting the 
proliferation, differentiation, or mobilization (chemotaxis) of immune cells. Immune 
cells develop through a process called hematopoiesis, producing myeloid (platelets, red 
blood cells, neutrophils, and macrophages) and lymphoid (B and T lymphocytes) cells 
from pluripotent stem cells. The etiology of these immune deficiencies or disorders 
may be genetic, somatic, such as cancer or some autoimmune disorders, acquired (e.g., 
by chemotherapy or toxins), or infectious. Moreover, a polynucleotide or polypeptide 
of the present invention can be used as a marker or detector of a particular immune 
system disease or disorder. 

A polynucleotide or polypeptide of the present invention may be useful in 
treating or detecting deficiencies or disorders of hematopoietic cells. A polypeptide or 
polynucleotide of the present invention could be used to increase differentiation and 
proliferation of hematopoietic cells, including the pluripotent stem cells, in an effort to 
treat those disorders associated with a decrease in certain (or many) types hematopoietic 
cells. Examples of immunologic deficiency syndromes include, but are not limited to: 
blood protein disorders (e.g. agammaglobulinemia, dysgammaglobulinemia), ataxia 
telangiectasia, conunon variable immunodeficiency, Digeorge Syndrome, HIV 
infection, HTLV-BLV mfection, leukocyte adhesion deficiency syndrome, 
lymphopenia, phagocyte bactericidal dysfunction, severe combined immunodeficiency 
(SCIDs), Wiskott-Aldrich Disorder, anemia, thrombocytopenia, or hemoglobinuria. 

Moreover, a polypeptide or polynucleotide of the present invention could also 
be used to modulate hemostatic (the stopping of bleeding) or thrombolytic activity (clot 
formation). For example, by increasing hemostatic or thrombolytic activity, a 
polynucleotide or polypeptide of the present invention could be used to ti:eat blood 
coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 
disorders (e.g. thrombocytopenia), or wounds resulting from trauma, surgery, or other 
causes. Alternatively, a polynucleotide or polypeptide of the present invention that can 
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decrease hemostatic or thrombolytic activity could be used to inhibit or dissolve 
clotting. These molecules could be important in the treatment of heart attacks 
(infarction), strokes, or scarring. 

A polynucleotide or polypeptide of the present invention may also be useful in 
5 treating or detecting autoimmune disorders. Many autoinmiune disorders result from 
inappropriate recognition of self as foreign material by immune cells. This 
inappropriate recognition results in an immune response leading to the destruction of the 
host tissue. Therefore, the administration of a polypeptide or polynucleotide of the 
present invention that inhibits an immune response, particularly the proliferation, 

10 differentiation, or chemotaxis of T-cells, may be an effective therapy in preventing 
autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by the present 
invention include, but are not limited to: Addison's Disease, hemolytic anemia, 
antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic encephalomyelitis, 

15 glomerulonephritis, Goodpasture's Syndrome, Graves' Disease, Multiple Sclerosis, 
Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, Pemphigus, 
Polyendocrinopathies, Purpura, Reiter's Disease, Stiff-Man Syndrome, Autoimmune 
Thyroiditis, Systemic Lupus Erythematosus, Autoimmune Pulmonary Inflammation, 
Guillain-Barre Syndrome, insulin dependent diabetes mellitis, and autoinmiune 

20 inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly allergic 
asthma) or other respiratory problems, may also be U*eated by a polypeptide or 
polynucleotide of the present invention. Moreover, these molecules can be used to treat 
anaphylaxis, hypersensitivity to an antigenic molecule, or blood group incompatibility. 

25 A polynucleotide or polypeptide of the present invention may also be used to 

treat and/or prevent organ rejection or graft-versus-host disease (GVHD). Organ 
rejection occurs by host immune cell destmction of the transplanted tissue through an 
immune response. Similarly, an immune response is also involved in GVHD, but, in 
this case, the foreign transplanted inmiune cells destroy the host tissues. The 

30 administration of a polypeptide or polynucleotide of the present invention that inhibits 
an inmiune response, particularly the proliferation, differentiation, or chemotaxis of T- 
cells, may be an effective therapy in preventing organ rejection or GVHD. 

Similarly, a polypeptide or polynucleotide of the present invention may also be 
used to modulate inflammation. For example, the polypeptide or polynucleotide may 

35 inhibit the prohferation and differentiation of cells involved in an inflammatory 

response. These molecules can be used to treat inflammatory conditions, both chronic 
and acute conditions, including inflanunation associated with infection (e.g., septic 
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shock, sepsis, or systemic inflammatory response syndrome (SIRS)), ischemia- 
reperfiision injury, endotoxin lethality, arthritis, complement-mediated hyperacute 
rejection, nephritis, cytokine or chemokine induced lung injury, inflanrmiatory bowel 
disease, Crohn's disease, or resulting from over production of cytokines (e.g., TNF or 
IL-1.) 

Hyperproliferative Disorders 

A polypeptide or polynucleotide can be used to treat or detect hyperproliferative 
disorders, including neoplasms. A polypeptide or polynucleotide of the present 
invention may inhibit the proliferation of the disorder through direct or indirect 
interactions. Alternatively, a polypeptide or polynucleotide of the present invention 
may proUferate other cells which can inhibit the hyperproliferative disorder. 

For example, by increasing an inmiune response, particularly increasing 
antigenic qualities of the hyperproliferative disorder or by proliferating, differentiating, 
or mobilizing T-cells, hyperproliferative disorders can be treated. This immune 
response may be increased by either enhancing an existing immune response, or by 
initiating a new immune response. Alternatively, decreasing an inmiune response may 
also be a method of treating hyperproliferative disorders, such as a chemotherapeutic 
agent. 

Examples of hyperproliferative disorders that can be treated or detected by a 
polynucleotide or polypeptide of the present invention include, but are not limited to 
neoplasms located in the: abdomen, bone, breast, digestive system, liver, pancreas, 
peritoneum, endocrine glands (adrenal, parathyroid, pituitary, testicles, ovary, thymus, 
thyroid), eye, head and neck, nervous (central and peripheral), lymphatic system, 
pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated or detected by a 
polynucleotide or polypeptide of the present invention. Examples of such 
hyperproliferative disorders mclude, but are not limited to: hypergammaglobulinemia, 
lymphoproliferative disorders, paraproteinemias, purpura, sarcoidosis, Sezary 
Syndrome, Waldenstron's Macroglbbulinemia, Gaucher's Disease, histiocytosis, and 
any other hyperproliferative disease, besides neoplasia, located in an organ system 
listed above. 

Infectious Disease 

A polypeptide or polynucleotide of the present invention can be used to treat or 
detect infectious agents. For example, by increasing tiie immune response, particularly 
increasing the proliferation and differentiation of B and/or T cells, infectious diseases 
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may be treated. The immune response may be increased by either enhancing an existing 
immune response, or by initiating a new immune response. Alternatively, the 
polypeptide or polynucleotide of the present invention may also directly inhibit the 
infectious agent, without necessarily eliciting an inunune response. 
5 Viruses are one example of an infectious agent that can cause disease or 

symptoms that can be treated or detected by a polynucleotide or polypeptide of the 
present invention. Examples of viruses, include, but are not limited to the following 
DNA and RNA viral families: Arbovirus, Adenoviridae, Arenaviridae, Arterivims, 
Bimaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Flaviviridae, 

10 Hepadnaviridae (Hepatitis), Herpes viridae (such as, Cytomegalovirus, Herpes 
Simplex, Herpes Zoster), Mononegavirus (e.g!, Paramyxoviridae, Morbillivirus, 
Rhabdoviridae), Orthomyxoviridae (e.g.. Influenza), Papovaviridae, Parvoviridae, 
Picomavuridae, Poxviridae (such as Smallpox or Vaccinia), Reoviridae (e.g.. 
Rotavirus), Retroviridae (HTLV-I, HTLV-II, Lentivirus), and Togaviridae (e.g., 

1 5 Rubi virus). Viruses falling within these families can cause a variety of diseases or 
symptoms, including, but not limited to: arthritis, bronchioUitis, encephalitis, eye 
infections (e.g., conjunctivitis, keratitis), chronic fatigue syndrome, hepatitis (A, B, C. 
E, Chronic Active. Delta), meningitis, opportunistic infections (e.g., AIDS), 
pneumonia, Burkitt's Lymphoma, chickenpox , hemorrhagic fever, Measles, Mumps, 

20 Parainfluenza, Rabies, the common cold. Polio, leukemia, Rubella, sexually 

U-ansmitted diseases, skin diseases (e.g., Kaposi's, warts), and viremia. A polypeptide 
or polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

Similarly, bacterial or fungal agents that can cause disease or symptoms and that 

25 can be treated or detected by a polynucleotide or polypeptide of the present invention 
include, but not limited to, the following Gram-Negative and Gram-positive bacterial 
families and fungi: Actinomycetales (e.g., Corynebacterium, Mycobacterium, 
Norcardia), Aspergillosis, Bacillaceae (e.g.. Anthrax, Clostridium), Bacteroidaceae, 
Blastomycosis, Bordetella, Borrelia, Brucellosis, Candidiasis, Campylobacter, 

30 Coccidioidomycosis, Cryptococcosis, Dermatocycoses, Enterobacteriaceae (Klebsiella, 
Salmonella, Serratia, Yersinia), Erysipelothrix, Helicobacter, Legionellosis, 
Leptospirosis, Listeria, Mycoplasmatales, Neisseriaceae (e.g., Acinetobacter, 
Gonorrhea, Menigococcal), Pasteurellacea Infections (e.g., Actinobacillus, 
Heamophilus, Pasteurella), Pseudomonas, Rickettsiaceae, Chlamydiaceae, Syphilis, 

35 and Staphylococcal. These bacterial or fungal families can cause the following diseases 
or symptoms, including, but not limited to: bacteremia, endocarditis, eye infections 
(conjunctivitis, tuberculosis, uveitis), gingivitis, opportunistic infections (e.g., AIDS 
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related infections), paronychia, prosthesis-related infections, Reiter's Disease, 
respiratory tract infections, such as Whooping Cough or Empyema, sepsis. Lyme 
Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, food poisoning. 
Typhoid, pneumonia. Gonorrhea, meningitis, Chlamydia, Syphilis, Diphtheria, 
5 Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, tetanus, 

impetigo. Rheumatic Fever, Scarlet Fever, sexually transmitted diseases; skin diseases 
(e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, wound infections. 
A polypeptide or polynucleotide of the present invention can be used to treat or detect 
any of these symptoms or diseases. 

10 Moreover, parasitic agents causing disease or symptoms that can be treated or 

detected by a polynucleotide or polypeptide of the present invention include, but not 
limited to, the following families: Amebiasis, Babesiosis, Coccidiosis, 
Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, Giardiasis, Helminthiasis, 
Leishmaniasis, Theileriasis, Toxoplasmosis, Trypanosomiasis, and Trichomonas. 

15 These parasites can cause a variety of diseases or symptoms, including, but not limited 
to: Scabies, Trombiculiasis, eye infections, intestinal disease (e.g., dysentery, 
giardiasis), liver disease, lung disease, opportunistic infections (e.g., ATOS related). 
Malaria, pregnancy complications, and toxoplasmosis. A polypeptide or polynucleotide 
of the present invention can be used to treat or detect any of these symptoms or 

20 diseases. 

Preferably, treatment using a polypeptide or polynucleotide of the present 
invention could either be by administering an effective amount of a polypeptide to tiie 
patient, or by removing cells from the patient, supplying the cells with a polynucleotide 
of die present invention, and returning the engineered cells to the patient (ex vivo 
25 therapy). Moreover, the polypeptide or polynucleotide of the present invention can be 
used as an antigen in a vaccme to raise an immune response against infectious disease. 

Regeneratifti^ 

A polynucleotide or polypeptide of die present invention can be used to 
30 differentiate, proliferate, and atti-act ceUs, leading to die regeneration of tissues. (See, 
Science 276:59-87 (1997).) The regeneration of tissues could be used to repair, 
replace, or protect tissue damaged by congenital defects, ti-auma (wounds, bums, 
incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, periodontal 
disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, reperfiision 
35 injury, or systemic cytokine damage. 

Tissues ttiat could be regenerated using the present invention include organs 
(e.g., pancreas, Uver, intestine, kidney, skin, endotiielium), muscle (smooth, skeletal 
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or cardiac), vascular (including vascular endothelium), nervous, hematopoietic, and 
skeletal (bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration occurs 
without or decreased scarring. Regeneration also may include angiogenesis. 

Moreover, a polynucleotide or polypeptide of the present invention may increase 
regeneration of tissues difficult to heal. For example, increased tendon/ligament 
regeneration would quicken recovery time after damage. A polynucleotide or 
polypeptide of the present invention could also be used prophylactically in an effort to 
avoid damage. Specific diseases that could be treated include of tendinitis, carpal tunnel 
syndrome, and other tendon or ligament defects. A further example of tissue 
regeneration of non-healing wounds includes pressure ulcers, ulcers associated with 
vascular insufficiency, surgical, and traumadc wounds. 

Similarly, nerve and brain tissue could also be regenerated by using a 
polynucleotide or polypeptide of the present invention to proliferate and differentiate 
nerve cells. Diseases that could be treated using this method include central and 
peripheral nervous system diseases, neuropathies, or mechanical and traumatic 
disorders (e.g., spinal cord disorders, head trauma, cerebrovascular disease, and 
stoke). Specifically, diseases associated with peripheral nerve injuries, peripheral 
neuropathy (e.g., resulting from chemotherapy or other medical therapies), localized 
neuropathies, and central nervous system diseases (e.g., Alzheimer's disease, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy- 
Drager syndrome), could all be treated using the polynucleotide or polypeptide of the 
present invention. 

Chemotaxis 

A polynucleotide or polypeptide of the present invention may have chemotaxis 
activity. A chemotaxic molecule attracts or mobilizes cells (e.g., monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial 
cells) to a particular site in the body, such as inflammation, infection, or site of 
hyperproliferation. The mobilized cells can then fight off and/or heal the particular 
trauma or abnormality. 

A polynucleotide or polypeptide of the present invention may increase 
chemotaxic activity of particular cells. These chemotactic molecules can then be used to 
treat inflammation, infection, hyperproliferative disorders, or any immune system 
disorder by increasing the number of cells targeted to a particular location in the body. 
For example, chemotaxic molecules can be used to treat wounds and other trauma to 
tissues by attracting immune cells to the injured location. Chemotactic molecules of the 
present invention can also attract fibroblasts, which can be used to treat wounds. 
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It is also contemplated that a polynucleotide or polypeptide of the present 
invention may inhibit chemotactic activity. These molecules could also be used to treat 
disorders. Thus, a polynucleotide or polypeptide of the present invention could be used 
as an inhibitor of chemotaxis. 

5 

Binding Activity 

A polypeptide of the present invention may be used to screen for molecules that 
bind to the polypeptide or for molecules to which the polypeptide binds. The binding 
of the polypeptide and the molecule may activate (agonist), increase, inhibit 
10 (antagonist), or decrease activity of the polypeptide or the molecule bound. Examples 
of such molecules include antibodies, ohgonucleotides, proteins (e.g.. receptors),or 
small molecules. 

Preferably, the molecule is closely related to the natural ligand of the 
polypeptide, e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural 

15 or functional mimetic. (See, Coligan et al.. Current Protocols in Inmiunology 

l(2):Chapter 5 (1991).) Similarly, the molecule can be closely related to the natural 
receptor to which the polypeptide binds, or at least, a fragment of the receptor capable 
of being bound by the polypeptide (e.g., active site). In either case, the molecule can 
be rationally designed using known techniques. 

20 Preferably, the screening for these molecules involves producing appropriate 

cells which express the polypeptide, either as a secreted protein or on the cell 
membrane. Preferred cells include cells from mammals, yeast, Drosophila, or E. colL 
Cells expressing the polypeptide (or cell membrane containing the expressed 
polypeptide) are then preferably contacted with a test compound potentially containing 

25 the molecule to observe binding, stimulation, or inhibition of activity of either the 
polypeptide or the molecule. 

The assay may simply test binding of a candidate compound to the polypeptide, 
wherein binding is detected by a label, or in an assay involving competition with a 
labeled competitor. Further, the assay may test whether the candidate compound results 

30 in a signal generated by binding to the polypeptide. 

Alternatively, the assay can be carried out using cell-free preparations, 
polypeptide/molecule affixed to a solid support, chemical libraries, or natural product 
mixtures. The assay may also simply comprise the steps of mixing a candidate 
compound with a solution containing a polypeptide, measuring polypeptide/molecule 

35 activity or bindmg, and comparing the polypeptide/molecule activity or binding to a 
standard. 
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Preferably, an ELISA assay can measure polypeptide level or activity in a 
sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 
antibody can measure polypeptide level or activity by either binding, directly or 
indirectly, to the polypeptide or by competing with the polypeptide for a substrate. 
5 All of these above assays can be used as diagnostic or prognostic markers. The 

molecules discovered using these assays can be used to treat disease or to bring about a 
particular result in a patient (e.g., blood vessel growth) by activating or inhibiting the 
polypeptide/molecule. Moreover, the assays can discover agents which may inhibit or 
enhance the production of the polypeptide from suitably manipulated cells or tissues. 

10 Therefore, the invention includes a method of identifying compounds which 

bind to a polypeptide of the invention comprising the steps of: (a) incubating a 
candidate binding compound with a polypeptide of the invention; and (b) determining if 
binding has occurred. Moreover, the invention includes a method of identifying 
agonists/antagonists comprising the steps of: (a) incubating a candidate compound with 

1 5 a polypeptide of the invention, (b) assaymg a biological activity , and (b) determining if 
a biological activity of the polypeptide has been altered. 

Other Activities 

A polypeptide or polynucleotide of the present invention may also increase or 
20 decrease the differentiation or proliferation of embryonic stem cells, besides, as 

discussed above, hematopoietic lineage. 

A polypeptide or polynucleotide of the present invention may also be used to 

modulate mammalian characteristics, such as body height, weight, hair color, eye color, 

skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., cosmetic 
25 surgery). Similarly, a polypeptide or polynucleotide of the present invention may be 

used to modulate manmialian metabolism affecting catabolism, anabolism, processing, 

utilization, and storage of energy. 

A polypeptide or polynucleotide of the present invention may be used to change 

a mammal's mental state or physical state'by influencing biorhythnas, caricadic 
30 rhythms, depression (including depressive disorders), tendency for violence, tolerance 

for pain, reproductive capabilities (preferably by Activin or Inhibin-like activity), 

hormonal or endocrine levels, appetite, libido, memory, stress, or other cognitive 

qualities. 

A polypeptide or polynucleotide of the present invention may also be used as a 
35 food additive or preservative, such as to increase or decrease storage capabilities, fat 
content, lipid, protein, carbohydrate, vitamins, minerals, cofactors or other nutritional 
components. 
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Other Preferred Embodiments 

Other preferred embodiments of the claimed invention include an isolated 
nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical 
5 to a sequence of at least about 50 contiguous nucleotides in the nucleotide sequence of 
SEQ ID NO:X wherein X is any integer as defined in Table 1 . 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about Uie position of the 5' Nucleotide of the 
10 Clone Sequence and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 1 . 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
15 Start Codon and ending with the nucleotide at about the position of the 3' Nucleotide of 
the Clone Sequence as defined for SEQ ID NO:X in Table 1. 

Similarly preferred is a nucleic acid molecule wherein said sequence of 
contiguous nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the 
range of positions beginning with the nucleotide at about the position of the 5' 
20 Nucleotide of the First Amino Acid of the Signal Peptide and ending with the nucleotide 
at about the position of the 3* Nucleotide of the Clone Sequence as defined for SEQ ID 
NO:X in Table 1. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 150 contiguous 
25 nucleotides in the nucleotide sequence of SEQ ID NO:X. 

Further preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 500 contiguous 
nucleotides in the nucleotide sequence of SEQ ID NO:X. 

A fiuther preferred embodiment is a nucleic acid molecule comprising a 
30 nucleotide sequence which is at least 95% identical to the nucleotide sequence of SEQ 
ID NO:X beginning with the nucleotide at about the position of the 5* Nucleotide of the 
First Amino Acid of the Signal Peptide and ending with the nucleotide at about the 
position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in 
Table 1. 
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A further preferred embodiment is an isolated nucleic acid molecule comprising 
a nucleotide sequence which is at least 95% identical to the complete nucleotide 
sequence of SEQ ID NO:X. 

Also preferred is an isolated nucleic acid molecule which hybridizes under 
5 stringent hybridization conditions to a nucleic acid molecule, wherein said nucleic acid 
molecule which hybridizes does not hybridize under stringent hybridization conditions 
to a nucleic acid molecule having a nucleotide sequence consisting of only A residues or 
of only T residues. 

Also preferred is a composition of matter comprising a DNA molecule which 
10 comprises a human cDNA clone identified by a cDNA Clone Identifier in Table 1 , 
which DNA molecule is contained in the material deposited with the American Type 
Culture Collection and given the ATCC Deposit Number shown in Table 1 for said 
cDNA Clone Identifier. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
1 5 sequence which is at least 95% identical to a sequence of at least 50 contiguous 

nucleotides in the nucleotide sequence of a human cDNA clone identified by a cDNA 
Clone Identifier in Table I, which DNA molecule is contained in the deposit given the 
ATCC Deposit Number shown in Table 1 . 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of at 
20 least 50 contiguous nucleotides is included in the nucleotide sequence of the complete 
open reading frame sequence encoded by said human cDNA clone. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to sequence of at least 150 contiguous 
nucleotides in the nucleotide sequence encoded by said human cDNA clone. 
25 A further preferred embodiment is an isolated nucleic acid molecule comprising 

a nucleotide sequence which is at least 95% identical to sequence of at least 500 
contiguous nucleotides in the nucleotide sequence encoded by said human cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule comprising 
a nucleotide sequence which is at least 95% identical to the complete nucleotide 
30 sequence encoded by said human cDN A clone. 

A further preferred embodiment is a method for detecting in a biological sample 
a nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical 
to a sequence of at least 50 contiguous nucleotides in a sequence selected frorh the 
group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any integer 
35 as defined in Table 1 ; and a nucleotide sequence encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained fa the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1; which method 
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comprises a step of comparing a nucleotide sequence of at least one nucleic acid 
molecule in said sample with a sequence selected from said group and determining 
whether the sequence of said nucleic acid molecule in said sample is at least 95% 
identical to said selected sequence. 

Also preferred is the above method wherein said step of comparing sequences 
comprises determining the extent of nucleic acid hybridization between nucleic acid 
molecules in said sample and a nucleic acid molecule comprising said sequence selected 
from said group. Similarly, also preferred is the above method wherein said step of 
comparing sequences is performed by comparing the nucleotide sequence determined 
from a nucleic acid molecule in said sample with said sequence selected from said 
group. The nucleic acid molecules can comprise DNA molecules or RNA molecules. 

A further preferred embodiment is a method for identifying the species, tissue or 
cell type of a biological sample which method comprises a step of detecting nucleic acid 
molecules in said sample, if any, comprising a nucleotide sequence that is at least 95% 
identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any 
integer as defined in Table 1 ; and a nucleotide sequence encoded by a human cDN A 
clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with 
the ATCC Deposit Number shown for said cDN A clone in Table 1 . 

The method for identifying the species, tissue or cell type of a biological sample 
can comprise a step of detecting nucleic acid molecules comprising a nucleotide 
sequence in a panel of at least two nucleotide sequences, wherein at least one sequence 
in said panel is at least 95% identical to a sequence of at least 50 contiguous nucleotides 
in a sequence selected from said group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene encoding a secreted protein 
identified in Table 1, which method comprises a step of detecting in a biological sample 
obtained fi-om said subject nucleic acid molecules, if any, comprising a nucleotide 
sequence that is at least 95% identical to a sequence of at least 50 contiguous 
nucleotides in a sequence selected from the group consisting of: a nucleotide sequence 
of SEQ ID NO:X wherein X is any integer as defined in Table 1 ; and a nucleotide 
sequence encoded by a human cDNA clone identified by a cDN A Clone Identifier in 
Table 1 and contained in the deposit witii tiie ATCC Deposit Number shown for said 
cDN A clone in Table 1 . 

The method for diagnosing a pathological condition can comprise a step of 
detecting nucleic acid molecules comprising a nucleotide sequence in a panel of at least 
two nucleotide sequences, wherein at least one sequence in said panel is at least 95% 
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identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
said group. 

Also preferred is a composition of matter comprising isolated nucleic acid 

molecules wherein the nucleotide sequences of said nucleic acid molecules comprise a 
5 panel of at least two nucleotide sequences, wherein at least one sequence in said panel is 

at least 95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 

selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein 

X is any integer as defined in Table 1 ; and a nucleotide sequence encoded by a human 

cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
10 deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. The 

nucleic acid molecules can comprise DNA molecules or RNA molecules. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 

least 90% identical to a sequence of at least about 10 contiguous amino acids in the 

amino acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1. 
15 Also preferred is a polypeptide, wherein said sequence of contiguous amino 

acids is included in the amino acid sequence of SEQ ID NO:Y in the range of positions 

beginning with the residue at about the position of the First Amino Acid of the Secreted 

Portion and ending with the residue at about the Last Amino Acid of the Open Reading 

Frame as set forth for SEQ ID NO: Y in Table 1 . 
20 Also preferred is an isolated polypeptide comprising an amino acid sequence at 

least 95% identical to a sequence of at least about 30 contiguous amino acids in the 

amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 

at least 95% identical to a sequence of at least about 100 contiguous amino acids in the 
25 amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 

at least 95% identical to the complete amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 

at least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
30 complete amino acid sequence of a secreted protein encoded by a human cDNA clone 

identified by a cDNA Qone Identifier in Table 1 and contained in die deposit vnxh the 

ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is a polypeptide wherein said sequence of contiguous amino 

acids is mcluded in the amino acid sequence of a secreted portion of die secreted protein 
35 encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and 

contained in the deposit with the ATCC Deposit Number shown for said cDN A clone in 

Table 1. 
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Also preferred is an isolated polypeptide comprising an anuno acid sequence at 
least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of the secreted portion of the protein encoded by a human cDNA 
clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with 
5 the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 100 contiguous amino acids in the 
amino acid sequence of the secreted portion of the protein encoded by a human cDNA 
clone identified by a cDNA Clone Identifier in Table 1 and contained in die deposit with 
10 the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to the amino acid sequence of the secreted portion of the protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table I and 
contained in the deposit with the ATCC Deposit Number shown for said cDN A clone in 
15 Table 1. 

Further preferred is an isolated antibody which binds specifically to a 
polypeptide comprising an amino acid sequence that is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 

20 defined in Table 1 ; and a complete amino acid sequence of a protein encoded by a 

human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in 
the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Further preferred is a method for detecting in a biological sample a polypeptide 
comprising an amino acid sequence which is at least 90% identical to a sequence of at 

25 least 10 contiguous amino acids in a sequence selected from the group consisting of: an 
amino acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; 
and a complete amino acid sequence of a protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number showh for said cDNA clone in Table 1; which method 

30 comprises a step of comparing an amino acid sequence of at least one polypeptide 
molecule in said sample with a sequence selected from said group and determining 
whether the sequence of said polypeptide molecule in said sample is at least 90% 
identical to said sequence of at least 10 contiguous amino acids. 

Also preferred is the above method wherein said step of comparing an amino 

35 acid sequence of at least one polypeptide molecule in said sample widi a sequence 
selected fi-om said group comprises determining the extent of specific binding of 
polypeptides in said sample to an antibody which binds specifically to a polypeptide 
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comprising an amino acid sequence that is at least 90% identical to a sequence of at least 
10 contiguous amino acids in a sequence selected from the group consisting of: an 
amino acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; 
and a complete amino acid sequence of a protein encoded by a human cDNA clone 
5 identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is the above method wherein said step of comparing sequences is 
performed by comparing the amino acid sequence determined from a polypeptide 
molecule in said sample with said sequence selected from said group. 

10 Also preferred is a method for identifying the species, tissue or cell type of a 

biological sample which method comprises a step of detecting polypeptide molecules in 
said sample, if any, comprising an amino acid sequence that is at least 90% identical to 
a sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO:Y wherein Y is any integer as 

15 defined in Table 1 ; and a complete amino acid sequence of a secreted protein encoded 
by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is the above method for identifying the species, tissue or cell type 
of a biological sample, which method comprises a step of detecting polypeptide 

20 molecules comprising an amino acid sequence in a panel of at least two amino acid 
sequences, wherein at least one sequence in said panel is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the above 
group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
25 associated with abnormal structure or expression of a gene encoding a secreted protein 
identified in Table 1, which method comprises a step of detecting in a biological sample 
obtained from said subject polypeptide molecules comprising an amino acid sequence in 
a panel of at least two amino acid sequences, wherein at least one sequence in said panel 
is at least 90% identical to a sequence of at least 10 contiguous amino acids in a 
30 sequence selected fiom the group consisting of: an amino acid sequence of SEQ ID 
NO:Y wherein Y is any integer as defined in Table 1; and a complete amino acid 
sequence of a secreted protein encoded by a human cDNA clone identified by a cDNA 
Clone Identifier in Table 1 and contained in the deposit with the ATCC Deposit Number 
shown for said cDN A clone in Table 1 . 
35 In any of these methods, the step of detecting said polypeptide molecules 

includes using an antibody. 
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Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a nucleotide sequence encoding a 
polypeptide wherein said polypeptide comprises an amino acid sequence that is at least 
90% identical to a sequence of at least 10 contiguous amino acids in a sequence selected 
from the group consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is 
any integer as defined in Table 1 ; and a complete amino acid sequence of a secreted 
protein encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 
1 and contained in the deposit with die ATCC Deposit Number shown for said cDNA 
clone in Table 1 . 

Also preferred is an isolated nucleic acid molecule, wherein said nucleotide 
sequence encoding a polypeptide has been optimized for expression of said polypeptide 
in a prokaryotic host. 

Also preferred is an isolated nucleic acid molecule, wherein said polypeptide 
comprises an amino acid sequence selected from the group consisting of: an amino acid 
sequence of SEQ ID NO: Y wherem Y is any integer as defined in Table 1 ; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in die deposit with the 
ATCC Deposit Number shown for said cDN A clone in Table 1 . 

Further preferred is a method of making a recombinant vector comprising 
inserting any of the above isolated nucleic acid molecule into a vector. Also preferred is 
the recombinant vector produced by this method. Also preferred is a method of making 
a recombinant host cell comprising introducing the vector into a host cell, as well as die 
recombinant host cell produced by tiiis method. 

Also preferred is a method of making an isolated polypeptide comprising 
culturing this recombinant host cell under conditions such tiiat said polypeptide is 
expressed and recovering said polypeptide. Also preferred is this metiiod of making an 
isolated polypeptide, wherein said recombinant host cell is a eukaryotic cell and said 
polypeptide is a secreted portion of a human secreted protein comprising an amino acid 
sequence selected from the group consisting of: an amino acid sequence of SEQ ID 
NO: Y beginning widi the residue at the position of the First Amino Acid of die Secreted 
Portion of SEQ ID NO: Y wherein Y is an integer set forth in Table 1 and said position 
of die First Amino Acid of die Secreted Portion of SEQ ID NO: Y is defined in Table 1 ; 
and an amino acid sequence of a secreted portion of a protein encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with die ATCC Deposit Number shown for said cDNA clone in Table 1. The 
isolated polypeptide produced by this mediod is also preferred. 
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_ Also preferred is a method of treatment of an individual in need of an increased 
level of a secreted protein activity, which method comprises administering to such an 
individual a pharmaceutical composition comprising an amount of an isolated 
polypeptide, polynucleotide, or antibody of the claimed invention effective to increase 
5 the level of said protein activity in said individual. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

10 Examples 

Example 1: Isolation of a Selected cDNA Clone From the Dep ositpH 
Sample 

Each cDN A clone in a cited ATCC deposit is contained in a plasmid vector. 
15 Table 1 identifies the vectors used to construct the cDNA library from which each clone 
was isolated. In many cases, the vector used to construct the library is a phage vector 
from which a plasmid has been excised. The table immediately below correlates the 
related plasmid for each phage vector used in constructing the cDNA library. For 
example, where a particular clone is identified in Table 1 as being isolated in the vector 
20 "Lambda Zap," the corresponding deposited clone is in "pBluescript." 

Vector Used to Construct Library Corresponding Deposited Pla^smiH 

Lambda Zap pBluescript (pBS) 

Uni-Zap XR pBluescript (pBS) 

Zap Express pBK 
25 lafmidBA plafinidBA 

pSportl pSportl 
pCMVSport2.0 pCMVSport2.0 
pCMVSportS.O pCMVSportS.O 
PCR^2.1 pCR®2.1 
30 Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uni-Zap 

XR (U.S. Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 
5,128,256 and 5,286,636), pBluescript (pBS) (Short, J. M. et al.. Nucleic Acids Res. 
16:7583-7600 (1988); Alting-Mees, M. A. and Short, J. M., Nucleic Acids Res. 
17:9494 (1989)) and pBK (Alting-Mees, M. A. et al.. Strategies 5:58-61 (1992)) are 
35 commercially available from Stratagene Cloning Systems, Inc., 1 101 1 N. Torrey Pines 
Road, La Jolla, CA, 92037. pBS contains an ampicillin resistance gene and pBK 
contains a neomycin resistance gene. Both can be transformed into E. coli strain XL-1 
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Blue, also available from Stratagene. pBS comes in 4 forms SK+, SK-, KS+ and KS. 
The S and K refers to the orientation of the polylinker to die T7 and T3 primer 
sequences which flank the polylinker region ("S" is for Sad and "K" is for Kpnl which 
are the first sites on each respective end of the Unker). or refer to the orientation 
of the f 1 origin of replication ("ori"), such that in one orientation, single stranded rescue 
initiated from the f 1 ori generates sense strand DNA and in the other, antisense. 

Vectors pSportl, pCMVSport 2.0 and pCMVSport 3.0, were obtained from 
Life Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Sport vectors 
contain an ampicillin resistance gene and may be transformed into E. coli strain 
DHIOB, also available from Life Technologies. (See, for instance, Gruber, C. E., et 
aL, Focus 15:59 (1993).) Vector lafmid BA (Bento Soares, Columbia University, NY) 
contains an ampicillin resistance gene and can be transformed into E. coli strain XL- 1 
Blue. Vector pCR®2,l, which is available from Invitrogen, 1600 Faraday Avenue, 
Carlsbad, CA 92008, contains an ampicillin resistance gene and may be transformed 
into E. coli strain DHIOB, available from Life Technologies. XSee, for instance, Clark, 
J. M., Nuc. Acids Res. 16:9677-9686 (1988) and Mead, D. et al., Bio/Technology 9: 
(1991).) Preferably, a polynucleotide of the present invention does not comprise the 
phage vector sequences identified for the particular clone in Table 1, as well as the 
corresponding plasmid vector sequences designated above. 

The deposited material in the sample assigned the ATCC Deposit Number cited 
in Table 1 for any given cDNA clone also may contain one or more additional plasmids, 
each comprising a cDNA clone different from that given clone. Thus, deposits sharing 
the same ATCC Deposit Number contain at least a plasmid for each cDNA clone 
identified m Table 1. Typically, each ATCC deposit sample cited in Table 1 comprises 
a mixture of approximately equal amounts (by weight) of about 50 plasmid DNAs, each 
containing a different cDNA clone; but such a deposit sample may include plasmids for 
more or less than 50 cDN A clones, up to about 500 cDNA clones. 

Two approaches can be used to isolate a particular clone from the deposited 
sample of plasmid DNAs cited for that clone m Table 1. First, a plasmid is directly 
isolated by screening the clones using a polynucleotide probe corresponding to SEQ ID 
NO:X. 

Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized 
using an Applied Biosystems DNA synthesizer according to the sequence reported. 
The oligonucleotide is labeled, for instance, with ^^P-y- ATP using T4 polynucleotide 
kinase and purified according to routine methods. (E.g., Maniatis et al.. Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) 
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The plasmid mixture is transfonned into a suitable host, as indicated above (such as 
XL-1 Blue (Stratagene)) using techniques known to those of skill in the art, such as 
those provided by the vector supplier or in related publications or patents cited above. 
The transformants are plated on 1.5% agar plates (containing the appropriate selection 
5 agent, e.g., ampicillin) to a density of about 150 transformants (colonies) per plate. 
These plates are screened using Nylon membranes according to routine methods for 
bacterial colony screening (e.g., Sambrook et al.. Molecular Cloning: A Laboratory 
Manual, 2nd Edit., (1989), Cold Spring Harbor Laboratory Press, pages 1.93 to 
L 104), or other techniques known to those of skill in the art. 

10 Alternatively, two primers of 17-20 nucleotides derived from both ends of the 

SEQ ID NO:X (i.e., within the region of SEQ ID NO:X bounded by the 5* NT and the 
3' NT of the clone defined in Table 1) are synthesized and used to amplify the desired 
cDNA using the deposited cDNA plasmid as a template. The polymerase chain reaction 
is carried out under routine conditions, for instance, in 25 jil of reaction mixture with 

1 5 0.5 ug of the above cDNA template. A convenient reaction mixture is 1 .5-5 mM 

MgClj, 0.01 % (w/v) gelatin, 20 \jM each of dATP, dCTP, dGTP, dTTP, 25 pmol of 
each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PGR (denaturation 
at 94°C for 1 min; annealing at 55°C for 1 min; elongation at 72°C for 1 min) are 
performed with a Peridn-Elmer Cetus automated thermal cycler. The amphfied product 

20 is analyzed by agarose gel electrophoresis and the DN A band with expected molecular 
weight is excised and purified. The PGR product is verified to be the selected sequence 
by subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3' non-coding 
portions of a gene which may not be present in the deposited clone. These methods 

25 include but are not limited to, filter probing, clone enrichment using specific probes, 
and protocols similar or identical to 5' and 3' "RAGE" protocols which are well known 
in the art. For instance, a method similar to 5' RACE is available for generating the 
missing 5* end of a desired full-length transcript. (Fromont-Racine et al.. Nucleic Acids 
Res. 21(7):1683-1684 (1993).) 

30 Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a population 

of RNA presumably containing full-length gene RNA transcripts. A primer set 
containing a primer specific to the ligated RNA oligonucleotide and a primer specific to 
a known sequence of the gene of interest is used to PGR amplify the 5' portion of the 
desired full-length gene. This amplified product may then be sequenced and used to 

35 generate the fiill length gene. 
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This above method starts with total RNA isolated from the desired source, 
although poly-A+ RNA can be used. The RNA preparation can then be treated with 
phosphatase if necessary to eliminate 5* phosphate groups on degraded or damaged 
RNA which may interfere with the later RNA ligase step. The phosphatase should then 
5 be inactivated and the RNA treated with tobacco acid pyrophosphatase in order to 
remove the cap structure present at the 5* ends of messenger RNAs. This reaction 
leaves a 5' phosphate group at the 5* end of the cap cleaved RNA which can then be 
ligated to an RNA oligonucleotide using T4 RNA ligase. 

This modified RNA preparation is used as a template for first strand cDNA 
10 synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is 
used as a template for PGR amplification of the desired 5' end using a primer specific to 
the ligated RNA oligonucleotide and a primer specific to the known sequence of the 
gene of interest. The resultant product is then sequenced and analyzed to confirm that 
the 5* end sequence belongs to the desired gene. 

15 

Example 2; Isolation o f Genomic Clones Corresponding to a 
Polynucleotide 

A human genomic PI library (Genomic Systems, Inc.) is screened by PGR 
using primers selected for the cDNA sequence correspondmg to SEQ ID NO:X., 
20 according to the method described in Example 1. (See also, Sambrook.) 

Example 3: Tissue Distribution of Polypeptide 

Tissue distribution of mRN A expression of polynucleotides of the present 
invention is determined using protocols for Northern blot analysis, described by, 

25 among others, Sambrook et al. For example, a cDNA probe produced by the method 
described in Example 1 is labeled with P^^ using the rediprime™ DNA labehng system 
(Amersham Life Science), according to manufacturer's instructions. After labeling, the 
probe is purified using CHROMA SPIN- 100™ column (Clontech Laboratories, Inc.), 
according to manufacturer's protocol number PT1200-1. The purified labeled probe is 

30 then used to examine various human tissues for mRNA expression. 

Multiple Tissue Northern (MTN) blots contaming various human tissues (H) or 
human immune system tissues (IM) (Clontech) are examined with the labeled probe 
using ExpressHyb™ hybridization solution (Clontech) according to manufacturer's 
protocol number PT 1 1 90- 1 . Following hybridization and washing, the blots are 

35 mounted and exposed to fibn at -70*'C overnight, and the films developed according to 
standard procedures. 
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Example 4: Chromosomal Mapping of the Polynucleotides 

An oligonucleotide primer set is designed according to the sequence at the 5* 
endofSEQIDNO:X. This primer preferably spans about 100 nucleotides. This 
5 primer set is then used in a polymerase chain reaction under the following set of 

conditions : 30 seconds, 95X; 1 minute, 56°C; 1 minute, TO^'C. This cycle is repeated 
32 times followed by one 5 minute cycle at TO'^C. Human, mouse, and hamster DNA 
is used as template in addition to a somatic cell hybrid panel containing individual 
chromosomes or chromosome fragments (Bios, Inc). The reactions is analyzed on 
10 either 8% polyacrylamide gels or 3.5 % agarose gels. Chromosome mapping is 

determined by die presence of an approximately 100 bp PGR fragment in the particular 
somatic cell hybrid. 

Example 5i Bacterial Expression of a Polypeptide 

15 A polynucleotide encoding a polypeptide of the present invention is ampUfied 

using PGR oligonucleotide primers corresponding to the 5' and 3* ends of the DNA 
sequence, as outiined in Example 1, to synthesize insertion fragments. The primers 
used to amplify the cDNA insert shoxdd preferably contain restriction sites, such as 
BamHI and Xbal, at the 5' end of the primers in order to clone the amplified product 

20 into the expression vector. For example, BamHI and Xbal correspond to the restriction 
enzyme sites on the bacterial expression vector pQE-9. (Qiagen, Inc., Ghatswortii, • 
GA). This plasmid vector encodes antibiotic resistance (Amp^), a bacterial origin of 
replication (ori), an IPTG-regulatable promoter/operator (P/O), a ribosome binding site 
(RES), a 6-histidine tag (6-His), and restriction enzyme cloning sites. 

25 The pQE-9 vector is digested with BamHI and Xbal and the amplified fragment 

is ligated into the pQE-9 vector maintaining the reading frame initiated at the bacterial 
RBS. The ligation mixture is tiien used to transform die E. coli strain Ml 5/rep4 
(Qiagen, Inc.) which contains multiple copies of die plasmid pREP4, which expresses 
the lad repressor and also confers kanamycin resistance (Kan^. Transformants are 

30 identified by their ability to grow on LB plates and ampicillin/kanamycin resistant 
colonies are selected. Plasmid DNA is isolated and confirmed by restriction analysis. 

Clones containing the desired constructs are grown overnight (0/N) in liquid 
culture in LB media supplemented witii both Amp (100 ug/ml) and Kan (25 ug/ml). 
The O/N culture is used to inoculate a large culture at a ratio of 1: 100 to 1:250. The 

35 cells are grown to an optical density 600 (O.D.^ of between 0.4 and 0.6. IPTG 
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Gsopropyl-B-D-thiogalacto pyranoside) is then added to a final concentration of 1 mM. 
IPTG induces by inactivating the lad repressor, clearing the P/O leading to increased 
gene expression. 

Cells are grown for an extra 3 to 4 hours. Cells are then harvested by 
5 centrifugation (20 mins at 6000Xg). The cell pellet is solubilized in the chaotropic 
agent 6 Molar Guanidine HCl by stirring for 3-4 hours at 4^C. The cell debris is 
removed by centrifugation, and the supernatant containing the polypeptide is loaded 
onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (available from 
QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with high 

10 affinity and can be purified in a simple one-step procedure (for details see: The 
QIAexpressionist (1995) QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl, pH 8, 
the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed 
with 10 volumes of 6 M guanidine-HCl pH 6, and finally the polypeptide is eluted with 

15 6 M guanidine^HCl, pH 5. 

The purified protein is then renatured by dialyzing it against phosphate-buffered 
saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. Alternatively, the 
protein can be successfully refolded while immobilized on the Ni-NTA column. The 
recommended conditions are as follows: renature using a linear 6M- IM urea gradient in 

20 500 mM NaCl, 20% glycerol, 20 mM TrisMCl pH 7.4, containing protease inhibitors. 
The renaturation should be performed over a period of 1.5 hours or more. After 
renaturation the proteins are eluted by the addition of 250 mM immidazole. Immidazole 
is removed by a final dialyzing step against PBS or 50 mM sodium acetate pH 6 buffer 
plus 200 mM NaCl. The purified protein is stored at 4** C or frozen at -80"^ C. 

25 In addition to the above expression vector, the present invention further includes 

an expression vector comprising phage operator and promoter elements operatively 
linked to a polynucleotide of the present invention, called pHE4a. (ATCC Accession 
Number 209645, deposited on February 25, 1998.) This vector contains: 1) a 
neomycinphosphotransferase gene as a selection marker, 2) an E. coli origin of 

30 replication, 3) a T5 phage promoter sequence, 4) two lac operator sequences, 5) a 

Shine-Delgamo sequence, and 6) the lactose operon repressor gene (laclq). The origin 
of replication (oriC) is derived from pUC19 (LTI, Gaithersburg, MD). The promoter 
sequence and operator sequences are made synthetically. 

DNA can be inserted into the pHEa by restricting the vector with Ndel and 

35 Xbal, BamHI, Xhol, or Asp7 1 8, running the restricted product on a gel, and isolating 
the larger fragment (the stuffer fragment should be about 3 10 base pairs). The DNA 
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insert is generated according to the PGR protocol described in Example 1 , using PGR 
primers having restriction sites for Ndel (5' primer) and Xbal, BamHI, Xhol, or 
Asp718 (3' primer). ,The PGR insert is gel purified and restricted with compatible 
enzymes. The insert and vector are ligated according to standard protocols. 
5 The engineered vector could easily be substituted in the above protocol to 

express protein in a bacterial system. 

Example 6: Purification of a Poivpeptide from an Inclusion Body 

The following alternative method can be used to purify a polypeptide expressed 
10 in £* coll when it is present in the form of inclusion bodies. Unless otherwise specified, 
all of the following steps are conducted at 4- lO^C. 

Upon completion of the production phase of the £. coli fermentation, the cell 
culture is cooled to 4-lOX and the cells harvested by continuous centrifugation at 
15,000 ipm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
15 weight of cell paste and the amount of purified protein required, an appropriate amount 
of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 
mM EDTA, pH 7,4. The cells are dispersed to a homogeneous suspension using a 
high shear mixer. 

The cells are then lysed by passing the solution through a microfluidizer 
20 (Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is 
then mixed with NaCl solution to a final concentration of 0.5 M NaGl, followed by 
centrifugation at 7000 xg for 15 min. The resultant pellet is washed again using 0.5M 
NaGl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubihzed with 1 .5 M guanidine 
25 hydrochloride (GuHGl) for 2-4 hours. After 7000 xg centrifugation for 15 min., the 
pellet is discarded and the polypeptide containing supematant is incubated at 4°G 
overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble particles, 
the GuHGl solubilized protein is refolded by quickly mixing the GuHGl extract with 20 
30 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by 

vigorous stirring. The refolded diluted protein solution is kept at 4*'G without mixing 
for 12 hours prior to further purification steps. 

To clarify the refolded polypeptide solution, a previously prepared tangential 
filtration unit equipped with 0.16 ^un membrane filter with appropriate surface area 
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(e.g.. Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The 
filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosystems). The colunm is washed with 40 mM sodium acetate, pH 6.0 and eluted 
with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a 
5 stepwise manner. The absorbance at 280 nm of the effluent is continuously monitored. 
Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the polypeptide are then pooled and mixed with 4 volumes 
of water. The diluted sample is then loaded onto a previously prepared set of tandem 
columns of suong anion (Poros HQ-50, Perseptive Biosystems) and weak anion 
10 (Poros CM-20, Perseptive Biosystems) exchange resins. The columns are equilibrated 
with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium 
acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eluted using a 10 column 
volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1 .0 
M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are collected under constant A^^ 
15 monitoring of the effluent. Fractions containing the polypeptide (determined, for 
instance, by 16% SDS-PAGE) are then pooled. 

The resultant polypeptide should exhibit grcaterthan 95% purity after die above 
refolding and purification steps. No major contaminant bands should be observed from 

Commassie blue stained 16% SDS-PAGE gel when 5 ^g of purified protein is loaded. 

20 The purified protein can also be tested for endotoxin/LPS contamination, and typically 
the LPS content is less than 0.1 ng/ml according to LAL assays. 

Example 7; Cloning and Expression of a Polypeptide in a Baculovirus 
Expression System 

25 In this example, the plasmid shuttle vector pA2 is used to insert a polynucleotide 

into a baculovirus to express a polypeptide. This expression vector contains the strong 
polyhedrin promoter of the Autographa califomica nuclear polyhedrosis virus 
( AcMNPV) followed by convenient restriction sites such as BamHI, Xba I and 
Asp718. The polyadenylation site of the simian virus 40 ("SV40") is used for efficient 

30 polyadenylation. For easy selection of recombinant virus, the plasmid contains the 

beta-galactosidase gene from E, coli under control of a weak Drosophila promoter in the 
same orientation, followed by the polyadenylation signal of the polyhedrin gene. The 
inserted genes are flanked on both sides by viral sequences for cell-mediated 
homologous recombination with wild-type vual DNA to generate a viable virus that 

35 express the cloned polynucleotide. 
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Many other baculovirus vectors can be used in place of the vector above, such 
as pAc373, pVL941, and pAcIMl, as one skilled in the art would readily appreciate, as 
long as the construct provides appropriately located signals for transcription, 
translation, secretion and the like, including a signal peptide and an in-frame AUG as 
5 required. Such vectors are described, for instance, in Luckow et al., Virology 170:31- 
39 (1989). 

Specifically, the cDNA sequence contained in the deposited clone, including the 
AUG initiation codon and the naturally associated leader sequence identified in Table 1, 
is amplified using the PGR protocol described in Example 1. If the naturally occurring 

10 signal sequence is used to produce the secreted protein, the pA2 vector does not need a 
second signal peptide. Alternatively, the vector can be modified (pA2 GP) to include a 
baculovirus leader sequence, using the standard methods described in Summers et al., 
"A Manual of Methods for Baculovirus Vectors and Insect Cell Culture Procedures," 
Texas Agricultural Experimental Station Bulletin No. 1555 (1987), 

15 The amplified fragment is isolated from a 1% agarose gel using a conunercially 

available kit ("Geneclean," BIO 101 Inc., La JoUa, Ca.). The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1% agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and 
optionally, can be dephosphorylated using calf intestinal phosphatase, using routine 

20 procedures known in the art. The DNA is then isolated from a 1% agarose gel using a 
commercially available kit ("Geneclean" BIO 101 Inc., La JoUa, Ca.). 

The fragment and the dephosphorylated plasmid are ligated together with T4 
DNA ligase. E. coli HB 101 or other suitable E, coli hosts such as XL- 1 Blue 
(Stratagene Cloning Systems, La Jolla, CA) cells are transformed with the ligation 

25 mixture and spread on culture plates. Bacteria containing the plasmid are identified by 
digesting DNA from individual colonies and analyzing the digestion product by gel 
electrophoresis. The sequence of the cloned firagment is confirmed by DNA 
sequencing. 

Five ^g of a plasmid containing the polynucleotide is co-transfected with 1 .0 fig 
30 of a commercially available linearized baculovirus DNA ("BaculoGold™ baculovirus 
DNA", Pharmingen, San Diego, CA), usuig the lipofection method described by 
Feigner et al., Proc. Natl. Acad. Sci. USA 84:7413-7417 (1987). One fig of 
BaculoGold™ virus DNA and 5 jig of the plasmid are mfated m a sterile well of a 
microtiter plate containing 50 |Lil of serum-free Grace's medium (Life Technologies 
35 Inc., Gaithersburg, MD). Afterwards, 10 jU Lipofectin plus 90 \i\ Grace's medium are 
added, mixed and incubated for 15 minutes at room temperature. Then the transfection 
mixture is added drop-wise to Sf9 insect cells (ATCC CRL 171 1) seeded in a 35 mm 
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tissue culture plate with 1 ml Grace's medium without serum. The plate is then 
incubated for 5 hours at IT C. The transfection solution is then removed from the plate 
and 1 ml of Grace's insect medium supplemented with 10% fetal calf serum is added. 
Cultivation is then continued at 2T C for four days. 
5 After four days the supernatant is collected and a plaque assay is performed, as 

described by Summers and Smith, supra. An agarose gel with "Blue Gal" (Life 
Technologies Inc., Gaithersburg) is used to allow easy identification and isolation of 
gal-expressing clones, which produce blue-stained plaques. (A detailed description of a 
"plaque assay" of this type can also be found in the user's guide for insect cell culture 

10 and baculovirology distributed by Life Technologies Inc., Gaithersburg, page 9-10.) 
After appropriate incubation, blue stained plaques are picked with the tip of a 
micropipettor (e.g., Eppendorf). The agar containing the recombinant viruses is then 
resuspended in a microcentrifuge tube containing 200 |il of Grace's medium and the 
suspension containing the recombinant baculovirus is used to infect Sf9 cells seeded in 

15 35 mm dishes. Four days later the supematants of these culture dishes are harvested 
and then they are stored at 4° C. 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace's 

medium supplemented v^th 10% heat-inactivated FBS. The cells are infected with the 

recombinant baculovirus containing the polynucleotide at a multiplicity of infection 

20 ("MOI") of about 2. If radiolabeled proteins are desired, 6 hours later the medium is 
removed and is replaced with SF900 II medium minus methionine and cysteme 
(available from Life Technologies Inc., Rockville, MD). After 42 hours, 5 fuCi of ^^S- 
methionine and 5 ^iCi ^^S-cysteine (available from Amersham) are added. The cells are 
further incubated for 16 hours and then are harvested by centrifugation. The proteins 

25 in the supernatant as well as the intracellular proteins are analyzed by SDS-PAGE 
followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of piuified 
protein may be used to determine the amino terminal sequence of the produced 
protein. 



30 



Examples; Expression of a Polypeptide in Mammalian Cells 

The polypeptide of the present invention can be expressed in a marhmalian cell. 
A typical mammalian expression vector contains a promoter element, which mediates 
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the initiation of transcription of mRNA, a protein coding sequence, and signals required 
for the termination of transcription and polyadenylation of the transcript. Additional 
elements include enhancers, Kozak sequences and intervening sequences flanked by 
donor and acceptor sites for RNA splicing. Highly efficient transcription is achieved 
5 with the early and late promoters from S V40, the long terminal repeats (LTRs) from 
Retroviruses, e.g., RSV, HTLVI, HIVI and the early promoter of the cytomegalovirus 
(CMV). However, cellular elements can also be used (e.g., the human actin promoter). 

Suitable expression vectors for use in practicing the present invention include, 
for example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), 
10 pRSVcat (ATCC 37152), pSV2dhfr (ATCC 37146), pBC12MI (ATCC 67109), 
pCMVSport 2.0, and pCMVSport 3.0. Mammalian host cells that could be used 
include, human Hela, 293, H9 and Jurkat cells, mouse NIH3T3 and C127 cells, Cos 1, 
Cos 7 and CVlJquaU QCl-3 cells, mouse L cells and Chinese hamster ovary (CHO) 
cells. 

15 Alternatively, the polypeptide can be expressed in stable ceU lines containing the 

polynucleotide integrated into a chromosome. The co-transfection with a selectable 
marker such as dhfr, gpt, neomycin, hygromycin allows the identification and isolation 
of the transfected cells. 

The transfected gene can also be amplified to express large amounts of the 

20 encoded protein. The DHFR (dihydrofolate reductase) marker is useful in developing 
cell lines that carry several hundred or even several thousand copies of the gene of 
interest. (See, e.g., Alt, R W., et al., J. Biol. Chem. 253:1357-1370 (1978); Hamlin, 
J. L. and Ma, C, Biochem. et Biophys. Acta, 1097:107-143 (1990); Page, M. J. and 
Sydenham, M. A., Biotechnology 9:64-68 (1991).) Another useful selection marker is 

25 the enzyme glutamine synthase (GS) (Murphy et al., Biochem J. 227:277-279 ( 1 99 1 ); 
Bebbington et al., Bio/Technology 10:169-175 (1992). Using these markers, the 
mammalian cells are grown in selective medium and the cells with the highest resistance 
are selected. These cell lines contain the amplified gene(s) integrated into a 
chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 

30 production of proteins. 

Derivatives of the plasmid pSV2-dhfr (ATCC Accession No. 37146), the 
expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession 
No.209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (Cullen et 
al.. Molecular and Cellular Biology, 438-447 (March, 1985)) plus a fragment of the 

35 CMV-enhancer (Boshart et al.. Cell 41:521-530 (1985).) Multiple cloning sites, e.g., 
with the restriction enzyme cleavage sites BamHI, Xbal and Asp718, facilitate the 
cloning of the gene of interest. The vectors also contain the 3* intron, the 
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polyadenylation and termination signal of the rat preproinsulin gene, and the mouse 
DHFR gene under control of the SV40 early promoter. 

Specifically, the plasmid pC6, for example, is digested with appropriate 
restriction enzymes and then dephosphorylated using calf iiitestinal phosphates by 
5 procedures known in the art. The vector is then isolated from a 1% agarose gel. 

A polynucleotide of the present invention is amplified according to the protocol 
outlined in Example 1 . If the naturally occurring signal sequence is used to produce the 
secreted protein, the vector does not need a second signal peptide. Alternatively, if the 
naturally occurring signal sequence is not used, the vector can be modified to include a 

10 heterologous signal sequence. (See, e.g., WO 96/34891.) 

The amplified fragment is isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1% agarose gel. 

The amplified fragment is then digested with the same restriction enzyme and 

15 purified on a 1% agarose gel. The isolated fragment and the dephosphorylated vector 
are then ligated with T4DNA ligase. E. coliHBm orXL-l Blue cells are then 
transformed and bacteria are identified that contain the firagment inserted into plasmid 
pC6 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene is used for 

20 transfection. Five ^ig of the expression plasmid pC6 is cotransfected with 0.5 |ig of the 
plasmid pSVneo using lipofectin (Feigner et al., supra). The plasmid pSV2-rieo 
contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme that 
confers resistance to a group of antibiotics including G418. The cells are seeded in 
alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are 

25 trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus 
MEM supplemented with 10, 25, or 50 ng/ml of metothrexate plus 1 mg/ml G418. 
After about 10-14 days single clones are trypsinized and then seeded in 6-well petri 
dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 100 nM, 
200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 

30 methotrexate are then transferred to new 6-well plates containing even higher 

concentrations of methotrexate (1 ^iM, 2 |jM, 5 pM, 10 mM, 20 mM). The same 
procedure is repeated imtil clones are obtained which grow at a concentration of 100 - 
200 fiM. Expression of the desired gene product is analyzed, for instance, by SDS- 
PAGE and Western blot or by reversed phase HPLC analysis. 
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Example 9; Protein Fusions 

The polypeptides of the present invention are preferably fiised to other proteins. 
These fusion proteins can be used for a variety of applications. For example, fusion of 
the present polypeptides to His-tag, HA-tag, protein A. IgG domains, and maltose 
5 binding protein facilitates purification. (See Example 5; see also EP A 394,827; 

Traunecker, et al.. Nature 331:84-86 (1988).) Similarly, fusion to IgG-t, IgG-3, and 
albumin increases the halflife time in vivo. Nuclear localization signals fused to the 
polypeptides of the present invention can target the protein to a specific subcellular 
localization, while covalent heterodimer or homodimers can increase or decrease the 

10 activity of a fusion protein. Fusion proteins can also create chimeric molecules having 
more than one function. Finally, fusion proteins can increase solubility and/or stability 
of the fused protein compared to the non-fused protein. All of the types of fusion 
proteins described above can be made by modifying the following protocol, which 
outlines the fusion of a polypeptide to an IgG molecule, or the protocol described in 

15 Examples. 

Briefly, the human Fc portion of the IgG molecule can be PGR amplified, using 
primers that span the 5* and 3' ends of the sequence described below. These primers 
also should have convenient restriction enzyme sites that will facilitate cloning into an 
expression vector, preferably a manmialian expression vector. 

20 For example, if pC4 (Accession No. 209646) is used, the human Fc portion can 

be ligated into the BamHI cloning site. Note that the 3' BamHI site should be 
destroyed. Next, the vector containing the human Fc portion is re-restricted with 
BamHI, linearizing the vector, and a polynucleotide of the present invention, isolated 
by the PGR protocol described in Example 1 , is ligated into this BamHI site. Note that 

25 , the polynucleotide is cloned without a stop codon, otherwise a fusion protein will not 
be produced. 

If the naturally occurring signal sequence is used to produce the secreted 
protein, pC4 does not need a second signal peptide. Alternatively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
30 heterologous signal sequence. (See, e.g., WO 96/3489 1 .) 

Human IgG Fc region: 

GGGATCCGGAGCCCAAATCITCTGACAAAACTCACACATGCCCACCGTGCC 
CAGCACCTGAATTCGAGGGTGCACCGTCAGTCTTCCTCTTCCCCCCAAAACC 
35 CAAGGACACCCTCATGATCTCCCGGACTCCTGAGGTCACATGCGTGGTGGT 
GGACGTAAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACG 
GCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAAC 
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AGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCT^ 
AATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAACrc^ 
ATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGT 
GTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCT 
5 GACCTGCCTGGTCAAAGGCTTCTATCCAAGCGACATCGCCGTGGAGTGGGA 
GAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGG 
ACTCCGACGGCTCCTTOTCCTCTACAGCAAGCrCACCGTGGAC^ 
GGTGGCAGCAGGGGAACGTCITCTCATGCTCCGTGATGCATGAGGCTCT^ 

ACAACCACTACACGCAGAAGAGCCTCrCCCTGTCTCCGGGTAAATGAGTGC 
10 GACGGCCGCGACTCTAGAGGAT (SEQ ID NO: 1) 

Example 10; Production of an Antibody from a Polypep tide 

The antibodies of the present invention can be prepared by a variety of methods. 
(See, Current Protocols, Chapter 2.) For example, cells expressing a polypeptide of 

1 5 the present invention is administered to an animal to induce the production of sera 
containing polyclonal antibodies. In a preferred method, a preparation of the secreted 
protein is prepared and purified to render it substantially free of natural contaminants. 
Such a preparation is then introduced into an animal in order to produce polyclonal 
antisera of greater specific activity. 

20 In the most preferred method, die antibodies of die present invention are 

monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kohler et al.. Nature 
256:495 (1975); Kohler et al., Eur. J. Immunol. 6:51 1 (1976); Kohler et aL, Eur. J. 
Immunol. 6:292 (1976); Hammerling et al., in: Monoclonal Antibodies and T-Cell 

25 Hybridoiiias, Elsevier, N.Y., pp. 563-681 (1981).) In general, such procedures 
involve immunizing an animal (preferably a mouse) with polypeptide or, more 
preferably, witfi a secreted polypeptide-expressing cell. Such cells may be cultured in 
any suitable tissue culture medium; however, it is preferable to culture cells in Earle's 
modified Eagle's medium supplemented witii 10% fetal bovine serum (inactivated at 

30 about 56''C), and supplemented with about 10 g/1 of nonessential amino acids, about 
1,000 U/ml of penicillin, and about 100 |lg/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma 
cell line. Any suitable myeloma cell line may be employed in accordance with the 
present invention; however, it is preferable to employ the parent myeloma cell line 

35 (SP20), available from the ATCC. After ftision, die resulting hybridoma ceUs are 
selectively maintained in HAT medium, and tiien cloned by limiting dilution as 
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described by Wands et al. (Gastroenterology 80:225-232 (1981).) The hybridoma cells 
obtained through such a selection are then assayed to identify clones which secrete 
antibodies capable of binding the polypeptide. 

Alternatively, additional antibodies capable of binding to the polypeptide can be 
5 produced in a two-step procedure using anti-idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves antigens, and therefore, it is 
possible to obtain an antibody which binds to a second antibody. In accordance with 
this method, protein specific antibodies are used to immunize an animal, preferably a 
mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
10 and the hybridoma cells are screened to identify clones which produce an antibody 

whose ability to bind to the protein-specific antibody can be blocked by the polypeptide. 
Such antibodies comprise anti-idiotypic antibodies to the protein-specific antibody and 
can be used to immunize an animal to induce formation of further protein-specific 
antibodies. 

1 5 It will be appreciated that Fab and F(ab')2 and other fragments of the antibodies 

of the present invention may be used according to the methods disclosed herein. Such 
fragments are typically produced by proteolytic cleavage, using enzymes such as papain 
(to produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, 
secreted protein-binding fragments can be produced through the application of 

20 recombinant DNA technology or through synthetic chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced using 
genetic constructs derived from hybridoma cells producing the monoclonal antibodies 
described above. Methods for producing chimeric antibodies are known in the art. 

25 (See, for review, Morrison, Science 229: 1202 (1985); Oi et al., BioTechniques 4:214 
(1986); Cabilly et al., U.S. Patent No. 4,816,567; Taniguchi et aL, EP 171496; 
Morrison et al., EP 173494; Neuberger et al., WO 8601533; Robinson et al., WO 
8702671; Boulianne et al.. Nature 312:643 (1984); Neuberger et al.. Nature 314:268 
(1985).) 

30 

Example 11: P roduction Of Secreted Protein For High-Throughpnt 
Screening Assavs 

The following protocol produces a supernatant containing a polypeptide to be 
tested. This supernatant can then be used in the Screening Assays described in 
35 Examples 13-20. 

First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution 
(Img/ml in PBS) 1:20 in PBS (w/o calcium or magnesium 17-5 16F Biowhittaker) for a 



wo 98/54963 



PCT/US98/11422 



working solution of 50ug/nil. Add 200 ul of this solution to each well (24 well plates) 
and incubate at RT for 20 minutes. Be sure to distribute the solution over each well 
(note: a 12-channel pipetter may be used with tips on every other channel). Aspirate off 
the Poly D"Lysine solution and rinse with Iml PBS (Phosphate Buffered Saline). The 
5 PBS should remain in the well until just prior to plating the cells and plates may be 
poly-lysine coated in advance for up to two weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 10^ cells/well in .5ml 
DMEM(Dulbecco's Modified Eagle Medium)(with 4.5 G/L glucose and L-glutamine 
(12-604F Biowhittaker))/10% heat inactivated FBS(14-503F Biowhittaker)/lx 
10 Penstrep( 17-602E Biowhittaker). Let the cells grow overnight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofectamine 
(18324-012 Gibco/BRL) and 5ml Optimem I (31985070 Gibco/BRL)/96-well plate. 
With a small volume multi-channel pipetter, aliquot approximately 2ug of an expression 
vector containing a polynucleotide insert, produced by the methods described in 
1 5 Examples 8 or 9, into an appropriately labeled 96-well round bottom plate. With a 
multi-channel pipetter, add 50ul of the Lipofectamine/Optimem I mixture to each well. 
Pipette up and down gently to mix. Incubate at RT 15-45 minutes. After about 20 
minutes, use a multi-channel pipetter to add ISOul Optimem I to each well. As a 
control, one plate of vector DNA lacking an insert should be transfected with each set of 
20 transfections. 

Preferably, the transfection should be performed by tag-teaming the following 
tasks. By tag-teaming, hands on time is cut in half, and the cells do not spend too 
much time on PBS. First, person A aspirates off the media from four 24-well plates of 
cells, and then person B rinses each well with .5-lml PBS. Person A then aspirates off 
25 PBS rinse, and person B, using al2-channel pipetter with tips on every other channel, 
adds the 200ul of DNA/Lipofectamine/Optimem I complex to the odd wells first, then to 
the even wells, to each row on the 24-well plates. Incubate at 3TC for 6 hours. 

While cells are incubating, prepare appropriate media, either 1%BSA in DMEM 
with Ix penstrep, or CHO-5 media (1 16.6 mg/L of CaC12 (anhyd); 0.00130 mg/L 

30 CuSO,-5H20; 0.050 mg/L of Fe(NO3)3-9H20; 0.417 mgO. of FeS04-7H20; 31 1.80 
mg/L of Kcl; 28.64 mg/L of MgCl^; 48.84 mg/L of MgSO^; 6995.50 mg/L of NaCl; 
2400.0 mg/L of N^CO,; 62.50 mg/L of NaH^PO^-HjO; 71.02 mg/L of Na^HPCM; 
.4320 mg/L of ZnS04-7H20; .002 mg/L of Arachidonic Acid ; 1 .022 mg/L of 
Cholesterol; .070 mg/L of DL-alpha-Tocopherol-Acetate; 0.0520 mg/L of Linoleic 

35 Acid; 0.010 mg/L of Linolenic Acid; 0.010 mg/L of Myristic Acid; 0.010 mg/L of Oleic 
Acid; 0.010 mg/L of Pahnitric Acid; 0.010 mg/L of Palmitic Acid; 100 mg/L of 
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Pluronic F-68; 0.010 mg/L of Stearic Acid; 2.20 mg/L of Tween 80; 455 1 mg/L of D- 
Glucose; 130.85 mg/ml of L- Alanine; 147.50 mg/ml of L-Arginine-HCL; 7.50 mg/ml 
of L-Asparagine-HjO; 6.65 mg/ml of L-Aspartic Acid; 29.56 mg/ml of L-Cystine- 
2HCL-H2O; 31.29 mg/ml of L-C:ystine-2HCL; 7.35 mg/ml of L-Glutamic Acid; 365.0 
5 mg/ml of L-Glutamine; 1 8.75 mg/ml of Glycine; 52.48 mg/ml of L-Histidine-HCL- 
HjO; 106.97 mg/ml of L-Isoleucine; 111.45 mg/ml of L-Leucine; 163.75 mg/ml of L- 
Lysine HCL; 32.34 mg/ml of L-Methionine; 68.48 mg/ml of L-Phenylalainine; 40.0 
mg/ml of L-Proline; 26.25 mg/ml of L-Serine; 101.05 mg/ml of L-Threonine; 19.22 
mg/ml of L-Tryptophan; 91.79 mg/ml of L-Tryrosine-2Na-2H20; 99.65 mg/ml of L- 

10 Valine; 0.0035 mg/L of Biotin; 3.24 mg/L of D-Ca Pantothenate; 1 1 .78 mg/L of 
Choline Chloride; 4.65 mg/L of Folic Acid; 15.60 mg/L of i-Inositol; 3.02 mg/L of 
Niacinamide; 3.00 mg/L of Pyridoxal HCL; 0.03 1 mg/L of Pyridoxine HCL; 0.3 19 
mg/L of Riboflavin; 3.17 mg/L of Thiamine HCL; 0.365 mg/L of Thymidine; and 
0.680 mg/L of Vitamin B.j; 25 mM of HEPES Buffer, 2.39 mg/L of Na Hypoxanthine; 

15 0. 1 05 mg/L of Lipoic Acid; 0.08 1 mg/L of Sodium Putrescine-2HCL; 55.0 mg/L of 
Sodium Pyruvate; 0.0067 mg/L of Sodium Selenite; 20uM of Ethanolamine; 0.122 
mg/L of Ferric Citrate; 41.70 mg/L of Methyl-B-Cyclodextrin complexed with Linoleic 
Acid; 33.33 mg/L of Methyl-B-Cyclodextrin complexed witfi Oleic Acid; and 10 mg/L 
of Methyl-B-Cyclodextrin complexed with Retinal) with 2nun glutamine and Ix 

20 penstrep. (BSA (81-068-3 Bayer) lOOgm dissolved in IL DMEM for a 10% BSA stock 
solution). Filter the media and collect 50 ul for endotoxin assay in 15ml polystyrene 
conical. 

The transfection reaction is terminated, preferably by tag-teaming, at the end of 
the incubation period. Person A aspirates off the transfection media, while person B 

25 adds 1 .5ml appropriate media to each well. Incubate at 37°C for 45 or 72 hours 
depending on the media used: 1 %BSA for 45 hours or CHO-5 for 72 hours. 

On day four, using a 300ul multichannel pipetter, aliquot 600ul in one 1ml deep 
well plate and the remaining supernatant into a 2ml deep well. The supematants from 
each well can then be used in the assays described in Examples 13-20. 

30 It is specifically understood that when activity is obtained in any of the assays 

described below using a supernatant, the activity originates from either the polypeptide 
directly (e.g., as a secreted protein) or by the polypeptide inducing expression of other 
proteins, which are then secreted into the supernatant. Thus, the invention further 
provides a method of identifying the protein in the supematant characterized by an 

35 activity in a particular assay. 
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Example 12; Construction of GAS Reporter Construct 

One signal transduction pathway involved in the differentiation and proliferation 
of cells is called the Jaks-STATs pathway. Activated proteins in the Jaks-STATs 
pathway bind to gamma activation site "GAS" elements or interferon-sensitive 
5 responsive element ("ISRE"), located in the promoter of many genes. The binding of a 
protein to these elements alter the expression of the associated gene. 

GAS and ISRE elements are recognized by a class of transcription factors called 
Signal Transducers and Activators of Transcription, or "STATs." There are six 
members of the STATs family. Statl and Stat3 are present in many cell types, as is 

10 Stat2 (as response to IFN-alpha is widespread). Stat4 is more restricted and is not in 
many cell types though it has been found in T helper class I, cells after treatment with 
IL-12. Stats was originally called manmiary growth factor, but has been found at 
higher concentrations in other cells including myeloid cells. It can be activated in tissue 
culture cells by many cytokines. 

15 The STATs are activated to translocate from the cytoplasm to the nucleus upon 

tyrosine phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") 
family. Jaks represent a distinct family of soluble tyrosine kinases and include Tyk2, 
Jakl, Jak2, and Jak3. These kinases display significant sequence similarity and are 
generally catalytically inactive in resting cells. 

20 The Jaks are activated by a wide range of receptors summarized in the Table 

below. (Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 64:621-51 
(1995).) A cytokine receptor family, capable of activating Jaks, is divided into two 
groups: (a) Class 1 includes receptors for IL-2, IL-3, IL-4, IL-6, IL-7, IL-9, IL-1 1, IL- 
12, IL-15, Epo, PRL, GH, G-CSF, GM-CSF, LIF, CNTF, and thrombopoietin; and 

25 (b) Class 2 includes IFN-a, IFN-g, and IL-10. The Class 1 receptors share a 

conserved cysteine motif (a set of four conserved cysteines and one tryptophan) and a 
WSXWS motif (a membrane proxial region encoding Trp-Ser-Xxx-Trp-Ser (SEQ ID 
NO:2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn 
30 activate STATs, which then translocate and bind to GAS elements. This entire process 
is encompassed in the Jaks-STATs signal transduction pathway. 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of 
the GAS or the ISRE element, can be used to indicate proteins involved in the 
proliferation and differentiation of cells. For example, growth factors and cytokines are 
35 known to activate the Jaks-STATs pathway. (See Table below.) Thus, by using GAS 
elements linked to reporter molecules, activators of the Jaks-STATs pathway can be 
identified. 
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To construct a synthetic GAS containing promoter element, which is used in the 
Biological Assays described in Examples 13-14, a PGR based strategy is employed to 
generate a GAS-SV40 promoter sequence. The 5' primer contains four tandem copies 
of the GAS binding site found in the IRFl promoter and previously demonstrated to 
5 bind STATs upon induction with a range of cytokines (Rothman et al., Inmiunity 

1:457-468 (1994).), although other GAS or ISRE elements can be used instead. The 5' 
primer also contains I8bp of sequence complementary to the SV40 early promoter 
sequence and is flanked with an Xhol site. The sequence of the 5' primer is: 

5':GCGCCrCGAGATTTCCCCGAAATCTAGATTTCCCCGAAATGATTTCCCCG 
10 AAATGATTTCCCCGAAATATCTGCCATCTCAATTAG:3' (SEQIDN0:3) 

The downstream primer is complementary to the S V40 promoter and is flanked 
with a Hind in site: 5':GCGGCAAGCTTTTTGCAAAGCCTAGGC:3' (SEQ ID 
NO:4) 

PGR amplification is performed using the SV40 promoter template present in 
15 the B-gal:promoter plasmid obtained from Clontech. The resulting PGR fragment is 
digested with Xhol/Hind III and subcloned into BLSK2-. (Stratagene.) Sequencing 
with forward and reverse primers confirms tiiat the insert contains the following 
sequence: 

5':Cr£GAeATTrCGGGGAAATCrAGATTrGGGGGAAATGATTrGGGGGA^ 

20 ATTTCCCGGAAATATCTGCCATCTGAATTAGTGAGGAAGGATAGTGGGGGGC 

GTAACTCGGCCCATGCCGCCGCTAACTGCGGCCAGTTCCGCCCATTGTGGGG 

GCGATGGCTGACTAATTTTTTTTATTTATGGAGAGGCCGAGGCGGCCTGGGG 

GTCTGAGCTATTGGAGAAGTAGTGAGGAGGGTTTTTTGGAGGGGTAGGGTTT 
TGGAAAAAGCn:3' (SEQIDNO:5) 

25 With this GAS promoter element linked to the SV40 promoter, a GAS:SEAP2 

reporter construct is next engineered. Here, die reporter molecule is a secreted alkaline 
phosphatase, or "SEAP." Clearly, however, any reporter molecule can be mstead of 
SEAP, in this or in any of the other Examples. Well known reporter molecules that can 
be used instead of SEAP include chloramphenicol acetyltransferase (GAT), luciferase, 

30 alkaline phosphatase, B-galactosidase, green fluorescent protein (GFP), or any protein . 
detectable by an antibody. 

The above sequence confirmed synthetic GAS-SV40 pronioter element is 
subcloned into die pSEAP-Promoter vector obtained from Glontech using Hindm and 
Xhol, effectively replacing the SV40 promoter witii the amplified GAS:SV40 promoter 

35 element, to create the GAS-SEAP vector. However, this vector does not contain a 
neomycin resistance gene, and tiierefore, is not preferred for mammalian expression 
systems. 
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Thus, in order to generate mammalian stable cell lines expressing the GAS- 
SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using 
Sail and NotI, and inserted into a backbone vector containing the neomycin resistance 
gene, such as pGFP-1 (Clontech), using these restriction sites in the multiple cloning 
5 site, to create the GAS-SEAP/Neo vector. Once this vector is transfected into 

mammalian cells, this vector can then be used as a reporter molecule for GAS binding 
as described in Examples 13-14. 

Other constructs can be made using the above description and replacing GAS 
with a different promoter sequence. For example, construction of reporter molecules 

10 containing NFK-B and EGR promoter sequences are described in Examples 15 and 16. 
However, many other pronioters can be substituted using the protocols described in 
these Examples. For instance, SRE, IL-2, NFAT, or Osteocalcin promoters can be 
substituted, alone or in combination (e.g., GAS/NF-KB/EGR, GAS/NF-KB, II- 
2/NFAT, or NF-KB/GAS). Similarly, other cell lines can be used to test reporter 

15 construct activity, such as HELA (epithelial), HUVEC (endothelial), Reh (B-cell), 
Saos-2 (osteoblast), HUVAC (aortic), or Cardiomyocyte. 

Example 13: High-Throughput Screening Assav for T-cell Activity 

The following protocol is used to assess T-cell activity by identifying factors, 

20 such as growth factors and cytokines, that may proliferate or differentiate T-cells. T- 
cell activity is assessed using the GAS/SEAP/Neo construct produced in Example 12. 
Thus, factors that increase SEAP activity indicate the ability to activate the Jaks-STATS 
signal transduction pathway. The T-cell used in this assay is Jurkat T-cells (ATCC 
Accession No. TIB-152), although Molt-3 cells (ATCC Accession No. CRL-1552) and 

25 Molt-4 cells (ATCC Accession No. CRL-1582) cells can also be used. 

Jurkat T-cells are lymphoblastic CD4-h Thl helper cells. In order to generate 
stable cell lines, approximately 2 milhon Jurkat cells are transfected with the GAS- 
SEAP/neo vector using DMRIE-C (Life Technologies)(transfection procedure 
described below). The transfected cells are seeded to a density of approximately 

30 20,000 cells per well and transfectants resistant to 1 mg/ml genticin selected. Resistant 
colonies are expanded and then tested for their response to increasing concentrations of 
interferon ganuna. The dose response of a selected clone is demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells 
containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to 

35 generate sufficient cells for multiple 96 well plates. Jurkat cells arc maintained in RPMI 
+ 10% serum with l%Pen-Strep. Combine 2.5 mis of OPTI-MEM (Life Technologies) 
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with 10 ug of plasmid DNA in a T25 flask. Add 2.5 ml OPTI-MEM containing 50 ul 
of DMRDE-C and incubate at room temperature for 15-45 mins. 

During the incubation period, count cell concentration, spin down the required 
number of cells (10^ per transfection), and resuspend in OPTI-MEM to a final 
5 concentration of 10^ cells/ml. Then add 1ml of 1 x 10^ cells in OPTI-MEM to T25 flask 
and incubate at 3TC for 6 hrs. After the incubation, add 10 ml of RPMI + 15% serum. 

The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 10% 
serum, 1 mg/ml Genticin, and 1% Pen-Strep. These cells are treated with supematants 
containing a polypeptide as produced by the protocol described in Example 11. 
10 On the day of treatment with the supernatant, the cells should be washed and 

resuspended in fresh RPMI + 10% serum to a density of 500,000 cells per ml. The 
exact number of cells required will depend on the number of supematants being 
screened. For one 96 well plate, approximately 10 million cells (for 10 plates, 100 
million cells) are required. 
15 Transfer the cells to a triangular reservoir boat, in order to dispense the cells into 

a 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 200 ul 
of cells into each well (therefore addmg 100, 000 cells per well). 

After all the plates have been seeded, 50 ul of the supematants are transferred 
directly from the 96 well plate containing the supematants into each well usmg a 12 
20 channel pipette. In addition, a dose of exogenous interferon gamma (0. 1 , 1 .0, 10 ng) 
is added to wells H9, HIO, and HI 1 to serve as additional positive controls for the 
assay. 

The 96 well dishes containing Jurkat cells treated with supematants are placed in 
an incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul samples 
25 from each well are then U-ansferred to an opaque 96 well plate using a 12 channel 

pipette. The opaque plates should be covered (using sellophene covers) and stored at - 
20^0 until SEAP assays are perfomfied according to Example 17. The plates 

containing the remaining treated cells are placed at 4^C and serve as a source of material 
for repeating the assay on a specific well if desired. 
30 As a positive control, 100 Unit/ml interferon ganmia can be used which is 

knovra to activate Jurkat T cells. Over 30 fold induction is typically observed in the 
positive control wells. 
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Example 14; High- Throughput Screening Assay Identifying Myeloid 
Activity 

The following protocol is used to assess myeloid activity by identifying factors, 
such as growth factors and cytokines, that may proliferate or differentiate myeloid cells. 
5 Myeloid cell activity is assessed using the GAS/SEAP/Neo construct produced in 

Example 12. Thus, factors that increase SEAP activity indicate the ability to activate the 
JakS'STATS signal transduction pathway. The myeloid cell used in this assay is U937, 
a pre-monocy te cell line, although TF- 1 , HL60, or KG 1 can be used. 

To transiently transfect U937 cells with the GAS/SEAP/Neo construct produced 
10 in Example 12, a DEAE-Dextran method (Kharbanda et. ah, 1994, Cell Growth & 
Differentiation, 5:259-265) is used. First, harvest 2xl0e7 U937 cells and wash with 
PBS. The U937 cells are usually grown in RPMI 1640 medium containing 10% heat- 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml peniciUin and 100 
mg/ml streptomycin. 

15 Next, suspend the cells in 1 ml of 20 mM Tris-HCl (pH 7.4) buffer containing 

0.5 mg/ml DEAE-Dextran, 8 ug GAS-SEAP2 plasmid DNA, 140 mM NaCI, 5 mM 
KCl, 375 uM Na2HP04.7H20, 1 mM MgCl2, and 675 uM CaCl2. Incubate at 37^0 
for 45 min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then 
20 resuspend in 10 ml complete medium and incubate at 37^0 for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 
ug/ml G418. The G418-free medium is used for routine growth but every one to two 
months, the cells should be re-grown in 400 ug/ml G418 for couple of passages. 

These cells are tested by harvesting 1x10 cells (this is enough for ten 96-well 
25 plates assay) and wash with PBS. Suspend the cells in 200 ml above described growth 
medium, with a final density of 5x10^ cells/ml. Plate 200 ul cells per well in the 96- 
well plate (or 1x10^ cells/well). 

. Add 50 ul of the supernatant prepared by the protocol described in Example 1 1 . 
Incubate at 37^0 for 48 to 72 hr. As a positive control, 100 Unit/ml interferon gamma 
30 can be used which is known to activate U937 cells. Over 30 fold induction is typically 
observed in the positive control wells. SEAP assay the supernatant according to the 
protocol described in Example 17. 



wo 98/54963 



PCTAJS98/11422 



247 



Example 15; High-Throughput Screening Agssav Identifying Neuronal 

When cells undergo differentiation and proliferation, a group of genes are 
activated through many different signal transduction pathways. One of these genes, 
5 EGRl (early growth response gene 1), is induced in various tissues and cell types upon 
activation. The promoter of EGRl is responsible for such induction. Using the EGRl 
promoter linked to reporter molecules, activation of cells can be assessed. 

Particularly, the following protocol is used to assess neuronal activity in PC 12 
cell lines. PC 12 cells (rat phenochromocytoma cells) are known to proliferate and/or 
10 differentiate by activation with a number of mitogens, such as TPA (tetradecanoyl 

phorbol acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). The 
EGRl gene expression is activated during this treatment. Thus, by stably transfecting 
PCI 2 cells with a construct containing an EGR promoter linked to SEAP reporter, 
activation of PCI 2 cells can be assessed. 
1 5 The EGR/SEAP reporter constiuct can be assembled by the following protocol. 

The EGR-1 promoter sequence (-633 to +l)(Sakamoto K et al.. Oncogene 6:867-871 
(1991)) can be PCR amplified from human genomic DNA using die following primers: 

5* GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3* (SEQ ID NO:6) 

5' GCGAAGCTTCGCGACTCCCCGGATCCGCCTC-3' (SEQ ID NO:7) 
20 Using the GAS:SEAP/Neo vector produced in Example 12, EGRl amplified 

product can then be inserted into this vector. Linearize the GAS:SEAP/Neo vector 
using resu-iction enzymes Xhol/Hindlll, removing the GAS/SV40 stuffer. Restrict the 
EGRl amplified product with these same enzymes. Ligate the vector and the EGRl 
promoter. 

25 To prepare 96 well-plates for cell culture, two mis of a coating solution ( 1 :30 

dilution of collagen type I (Upstate Biotech Inc. Cat#08-1 15) in 30% ethanol (filter 
sterilized)) is added per one 10 cm plate or 50 ml per well of the 96-well plate, and 
allowed to air dry for 2 hr. 

PCI 2 cells are routinely grown in RPMM640 medium (Bio Whittaker) 

30 containing 10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat- 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 
ug/ml streptomycin on a precoated 10 cm tissue culture dish. One to four split is done 
every three to four days. Cells are removed from the plates by scraping and 
resuspended with pipetting up and down for more than 15 times. 

35 Transfect the EGR/SEAP/Neo construct into PC 1 2 using the Lipofectamine 

protocol described in Example 11. EGR-SEAP/PC12 stable cells are obtained by 
growing die cells in 300 ug/ml G418. The G418-free medium is used for routine 
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growth but every one to two months, the cells should be re-grown in 300 ug/ml G418 
for couple of passages. 

To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS 
5 (Phosphate buffered saline). Then starve the cells in low serum medium (RPMI-1640 
containing 1 % horse serum and 0.5% FBS with antibiotics) overnight. 

The next morning, remove the medium and wash the cells with PBS. Scrape 
off the cells from the plate, suspend the cells well in 2 ml low serum medium. Count 

the cell number and add more low serum medium to reach final cell density as SxlO^ 
10 cells/ml. 

Add 200 ul of the cell suspension to each well of 96- well plate (equivalent to 
lxl05 cells/well). Add 50 ul supernatant produced by Example 1 1 , 37^C for 48 to 72 
hr. As a positive control, a growth factor known to activate PC 12 cells through EGR 
can be used, such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty-fold 
15 induction of SEAP is typically seen in the positive control wells. SEAP assay the 
supernatant according to Example 17. 

Example 16; High*Thr oughput Screening Assay for T>cell Activity 

NF-kB (Nuclear Factor kB) is a transcription factor activated by a wide variety 

20 of agents including the inflammatory cytokines IL- 1 and TNF, CD30 and CD40, 
lymphotoxin-alpha and lymphotoxin-beta, by exposure to LPS or thrombin, and by 
expression of certain viral gene products. As a transcription factor, NF-kB regulates 
the expression of genes involved in immune cell activation, control of apoptosis (NF- 
Kfi appears to shield cells from apoptosis), B and T-cell development, anti-viral and 

25 antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- kB is retained m the cytoplasm with I-kB 
(Inhibitor kB). However, upon stimulation, I- kB is phosphorylated and degraded, 
causing NF- kB to shuttle to the nucleus, thereby activating transcription of target 

genes. Target genes activated by NF- kB include IL-2, IL-6, GM-CSF, ICAM-1 and 
30 class IMHC. 

Due to its central role and ability to respond to a range of stimuli, reporter 
constructs utilizing the NF-kB promoter element are used to screen the supematants 
produced in Example 1 1 . Activators or inhibitors of NF-kB would be useful in treating 
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diseases. For example, inhibitors of NF-kB could be used to treat those diseases 
related to the acute or chronic activation of NF-kB, such as rheumatoid arthritis. 

To . construct a vector containing the NF-kB promoter element, a PGR based 
strategy is employed. The upstream primer contains four tandem copies of the NF-kB 
binding site (GGGGACTTTCCC) (SEQ ID N0:8), 18 bp of sequence complementary 
to the 5' end of the S V40 early promoter sequence, and is flanked with an Xhol site: 

S^GCGGCCTCGAGGGGACTTTCCCGGGGACTTTCCGGGGACTTTCCGGGAC 
TTTCCATCCTGCCATCTCAATTAGrS' (SEQIDN0:9) 

The downstream primer is complementary to the 3' end of the S V40 promoter 
and is flanked with a Hind III site: 

5':GCGGCAAGCrTTTTGCAAAGCCTAGGC:3' (SEQ ID N0:4) 

PGR amplification is performed using the SV40 promoter template present in 
the pB-gal:promoter plasmid obtained from Glontech. The resulting PGR fragment is 
digested with Xhol and Hind III and subcloned into BLSK2-. (Stratagene) 
Sequencing with the T7 and T3 primers confirms the insert contains the following 
sequence: 

5*:CTCGAGGGGACrTTCCCGGGGACITrCCGGGGACTTTCCGGGACTTTCC 

ATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGGCCA 

TCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACT 

AATTTTTTTTATITATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTC 

CAGAAGTAGTGAGGAGGCrTTTrTGGAGGCCrAGGCrTTTGCAAAAAGOT 
3' (SEQ ID NO: 10) 

Next, replace the SV40 minimal promoter element present in the pSEAP2- 

promoter plasmid (Glontech) with this NF-KB/SV40 fragment using Xhol and Hindm. 

However, this vector does not contain a neomycin resistance gene, and therefore, is not 
preferred for mammalian expression systeins. 

In order to generate stable mammalian cell lines, the NF-KB/SV40/SEAP 
cassette is removed from the above NF-kB/SEAP vector using restriction enzymes Sail 
and NotI, and inserted into a vector containing neomycin resistance. Particularly, the 
NF-KB/SV40/SEAP cassette was inserted into pGFP-1 (Glontech), replacing the GFP. 
gene, after restricting pGFP-1 with Sail and Notl. 
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Once NF-kB/S V40/SEAP/Neo vector is created, stable Jurkat T-cells are 
created and maintained according to. the protocol described in Example 13. Similarly, 
the method for assaying supematants with these stable Jurkat T-cells is also described 
in Example 13. As a positive control, exogenous TNF alpha (0.1,1, 10 ng) is added to . 
5 wells H9, H 10, and H 11 , with a 5- 1 0 fold activation typically observed. 

Example 17: Assav for SEAP Activity 

As a reporter molecule for the assays described in Examples 13-16, SEAP 
activity is assayed using the Tropbc Phospho-light Kit (Cat. BP-400) according to the 
10 following general procedure. The Tropix Phospho-light Kit supplies the Dilution, 
Assay, and Reaction Buffers used below. 

Prime a dispenser with the 2.5x Dilution Buffer and dispense 15 ^1 of 2.5x 
dilution buffer into Optiplates containing 35 |il of a supernatant. Seal the plates with a 

plastic sealer and incubate at 65oC for 30 min. Separate the Optiplates to avoid uneven 
15 heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser and 
prime with the Assay Buffer. Add 50 |il Assay Buffer and incubate at room 
temperature 5 min. Empty the dispenser and prime with the Reaction Buffer (see the 
table below). Add 50 fil Reaction Buffer and incubate at room temperature for 20 
20 minutes. Since the intensity of the chemiluminescent signal is time dependent, and it 
takes about 10 minutes to read 5 plates on luminometer, one should treat 5 plates at each 
time and start the second set 10 minutes later. 

Read the relative light unit in the luminometer. Set H12 as blank, and print the 
results. An increase in chemiluminescence indicates reporter activity. 

25 

Reaction Buffer Formulation: 



#of plates Rxn buffer diluent (mi) CSPD (ml) 

10 60 3 

n 65 3.25 

12 70 3.5 

13 75 3.75 

14 80 4 

15 85 4.25 

16 90 4.5 

17 95 4.75 

18 100 5 

19 105 5.25 

20 no 5.5 

21 115 5.75 

22 J20 6 
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Example 18: High-Throughput Screen ing Assay Identifying Changes in 
Small Molecu le Concentration and Membrane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
5 molecules, such as calcium, potassium, sodium, and pH, as well as alter membrane 
potential* These alterations can be measured in an assay to identify supematants which 
bind to receptors of a particular cell. Although the following protocol describes an 
assay for calcium, this protocol can easily be modified to detect changes in potassium, 
sodium, pH, membrane potential, or any other small molecule which is detectable by a 

10 fluorescent probe. 

The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") to 
measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be used 
instead of the calcium fluorescent molecule, fluo-3, used here. 

15 For adherent cells, seed the cells at 10,000 -20,000 cells/well in a Co-star black 

96-well plate with clear bottom. The plate is incubated in a CO^ incubator for 20 hours. 
The adherent cells are washed two times in Biotek washer with 200 ul of HBSS 
(Hank's Balanced Salt Solution) leaving 100 ul of buffer after the final wash. 
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A Stock solution of 1 mg/ml fluo-3 is made in 10% pluronic acid DMSO. To 
load the cells with fluo-3, 50 ul of 12 ug/ml fluo-3 is added to each well. The plate is 
mcubated at 3TC in a COj incubator for 60 min. The plate is washed four times in the 
Biotek washer with HBSS leaving 100 ul of buffer. 
5 For non-adherent cells, the cells are spun down from culture media. Cells are 

re-suspended to 2-5x10^ cells/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml 
fluo-3 solution in 10% pluronic acid DMSO is added to each ml of cell suspension. 
The tube is then placed in a 3TC water bath for 30-60 min. The cells are washed twice 
with HBSS, resuspended to 1x10^ cells/ml, and dispensed into a microplate, 100 
10 ul/well. The plate is centrifuged at 1000 rpm for 5 mdn. The plate is then washed once 
in Denley CellWash with 200 ul, followed by an aspiration step to 100 ul final volume. 

For a non-cell based assay, each well contains a fluorescent molecule, such as 
fluo-3. The supernatant is added to the well, and a change in fluorescence is detected. 

To measuire the fluorescence of intracellular calcium, the FLIPR is set for the 
15 following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 

second; (3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; and 
(6) Sample addition is 50 uL Increased emission at 530 nm indicates an extracellular 

signaling event which has resulted in an increase in the intracellular Ca++ 
concentration. 

20 

Example 19; High-Throughput Screening Assav Iden tifying Tyrosine 
Kinase Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of 
transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine Kinase 

25 RPTK) group are receptors for a range of mitogenic and metabolic growth factors 
including the PDGF, FGF, EGF, NGF, HGF and Insulin receptor subfamilies. In 
addition there are a large family of RPTKs for which the corresponding ligand is 
unknown. Ligands for RPTKs include mainly secreted small proteins, but also 
membrane-bound and extracellular matrix proteins. 

30 Activation of RPTK by ligands involves ligand-mediated receptor dimerization, 

resultmg in transphosphorylation of the receptor subunits and activation of the 
cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases include receptor 
associated tyrosine kinases of the src-family (e.g., src, yes, Ick, lyn, fyn) and non- 
receptor linked and cytosolic protein tyrosine kinases, such as the Jak family, members 

3i5 of which mediate signal transduction triggered by the cytokine superf amily of receptors 
(e.g., the Interleukins, Interferons, GM-CSF, and Leptin). 
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Because of the wide range of known factors capable of stimulating tyrosine 
kinase activity, the identification of novel human secreted proteins capable of activating 
tyrosine kinase signal transduction pathways are of interest. Therefore, the following 
protocol is designed to identify those novel human secreted proteins capable of 
5 activating the tyrosine kinase signal transduction pathways. 

Seed target cells (e.g., primary keratinocytes) at a density of approximately 
25,000 cells per well in a 96 well Loprodyne Silent Screen Plates purchased from 
Nalge Nunc (Naperville, IL). The plates are sterilized with two 30 minute rinses with 
100% ethanol, rinsed with water and dried overnight. Some plates are coated for 2 hr 

10 with 100 ml of cell culture grade type I collagen (50 mg/ml), gelatin (2%) or polylysine 
(50 mg/ml), all of which can be purchased from Sigma Chemicals (St. Louis. MO) or 
10% Matrigel purchased from Becton Dickinson (Bedford,MA), or calf serum, rinsed 
with PBS and stored at 4^0. Cell growth on these plates is assayed by seeding 5,000 
cells/well in growth medium and indirect quantitation of cell number through use of 

15 alamarBlue as described by the manufacturer Alamar Biosciences, Inc. (Sacramento, 
CA) after 48 hr. Falcon plate covers #3071 from Becton Dickinson (Bedford,MA) are 
used to cover die Loprodyne Silent Screen Plates. Falcon Microtest m cell culture 
plates can also be used in some proliferation experiments. 

To prepare extracts, A431 cells are seeded onto the nylon membranes of 

20 Loprodyne plates (20,000/200ml/well) and cultured overnight in complete medium. 
Cells are quiesced by incubation in serum-frce basal medium for 24 hr. After 5-20 
minutes treatment with EGF (60ng/ml) or 50 ul of the supernatant produced in Example 
1 1 , the medium was removed and 1 00 ml of extraction buffer ((20 mM HEPES pH 
7.5, 0.15 M NaCl, 1% Triton X-100, 0.1% SDS, 2 mM Na3V04, 2 mM Na4P207 

25 and a cocktail of protease inhibitors (# 1836170) obtained from Boeheringer Mannheim 
(Indianapolis, IN) is added to each well and the plate is shaken on a rotating shaker for 
5 minutes at 4^C. The plate is then placed in a vacuum transfer manifold and the extract 
filtered through the 0.45 mm membrane bottoms of each well using house vacuum. 
Exu-acts are collected in a 96-well catch/assay plate in the bottom of the vacuum 

30 manifold and inmiediately placed on ice. To obtain extracts clarified by centrifiigation. 
the content of each well, after detergent solubilization for 5 minutes, is removed and 
cenuifuged for 1 5 minutes at 4^C at 1 6,000 x g. 

Test the filtered extracts for levels of tyrosine kinase activity. Although many 
methods of detecting tyrosine kinase activity are known, one method is described here. 

35 Generally, the tyrosine kinase activity of a supernatant is evaluated by 

determining its ability to phosphorylate a tyrosine residue on a specific substrate (a 
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biotinylated peptide). Biotinylated peptides that can be used for this purpose include 
PSKl (corresponding to aniino acids 6-20 of the cell division kinase cdc2-p34) and 
PSK2 (corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for 
a range of tyrosine kinases and are available from Boehringer Maiinheim, 

5 The tyrosine kinase reaction is set up by adding the following components in 

order. First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg2+ (5mM 
ATP/50mM MgCl2), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, 
pH7.3, 40 mM beta-glycerophosphate, ImM EGTA, lOOmM MgCl2, 5 mM MnCl2, 
0.5 mg/ml BSA), tiien 5ul of Sodium Vanadaie(lmM), and then 5ul of water. Mix the 

10 components gentiy and preincubate the reaction mix at SO^C for 2 min. Initial the 
reaction by adding lOul of the control enzyme or the filtered supernatant. 

The tyrosine kinase assay reaction is then terminated by adding 10 ul of 120mm 
EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction 

1 5 mixture to a microtiter plate (MTP) module and incubating at 37^0 for 20 min. This 
allows the streptavadin coated 96 well plate to associate with the biotinylated peptide. 
Wash the MTP module with 300ul/well of PBS four times. Next add 75 ul of anti- 
phospotyrosine antibody conjugated to horse radish peroxidase(anti-P-Tyr- 

POD(0.5u/ml)) to each well and incubate at 37^ for one hour. Wash the well as 
20 above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) and 
incubate at room temperature for at least 5 mins (up to 30 min). Measure the 
absorbance of the sample at 405 nm by using ELIS A reader. The level of bound 
peroxidase activity is quantitated using an ELIS A reader and reflects the level of 
25 tyrosine kinase activity. 

Example 20: High-Th roughput Screening Assay Identifying 
Phosphorylation Activity 

As a potential alternative and/or compliment to the assay of protein tyrosine 
30 kinase activity described in Example 19, an assay which detects activation 

(phosphorylation) of major intracellular signal transduction mtermediates can also be 
used. For example, as described below one particular assay can detect tyrosine 
phosphorylation of the Erk-1 and Erk-2 kinases. However, phosphorylation of other 
molecules, such as Raf , JNK, p38 MAP, Map kinase kinase (MEK), MEK kinase, 
35 Src, Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any other 
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phosphoscrine, phosphotyrosine, or phosphothreonine molecule, can be detected by 
substituting these molecules for Erk-1 or Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96-well ELIS A 
plate with 0.1ml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are then 
5 rinsed with PBS and blocked with 3% BSA/PBS for 1 hr at RT. The protein G plates 
are then treated with 2 commercial monoclonal antibodies (lOOng/well) against Erk-1 
and Erk-2 (1 hr at RT) (Santa Cruz Biotechnology). (To detect other molecules, this 
step can easily be modified by substituting a monoclonal antibody detecting any of the 

above described molecules.) After 3-5 rinses with PBS, the plates are stored at 4^C 
10 until use. 

A431 cells are seeded at 20,000/well in a 96- well Loprodyne filterplate and 
cultured overnight in growth medium. The cells are then starved for 48 hr in basal 
medium (DMEM) and then treated with EOF (6ng/well) or 50 ul of the supematants 
obtained in Example 1 1 for 5-20 minutes. The cells are then solubilized and exu-acts 

15 filtered direcdy into the assay plate. 

After incubation with the extract for 1 hr at RT, the wells are again rinsed. As a 
positive control, a commercial preparation of MAP kinasfc (lOng/well) is used in place 
of A43 1 extract. Plates are then treated with a commercial polyclonal (rabbit) antibody 
( 1 ug/ml) which specifically recognizes the phosphorylated epitope of the Erk- 1 and 

20 Erk-2 kinases (1 hr at RT). This antibody is biotinylated by standard procedures. The 
bound polyclonal antibody is then quantitated by successive incubations with 
Europium-streptavidin and Europium fluorescence enhancing reagent in the Wallac 
DELFIA instrument (time-resolved fluorescence). An increased fluorescent signal over 
background indicates a phosphorylation! 

25 

Example 21: Method of Determining Alterations in a Gene 
Corresponding to a Polynucleotide 

RNA isolated from entire families or individual patients presenting with a 
phenotype of interest (such as a disease) is be isolated. cDNA is then generated from 
30 these RNA samples using protocols known in the art. (See, Sambrook.) The cDNA is 
then used as a template for PGR, employing primers surrounding regions of interest in 
SEQ ID Nb:X. Suggested PGR conditions consist of 35 cycles at 95°C for 30 

seconds; 60-120 seconds at 52-58°C; and 60-120 seconds at 70°C, using buffer 
solutions described in Sidransky, D., et al.. Science 252:706 (1991). 
35 PGR products are then sequenced using primers labeled at their 5' end with T4 

polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre Technologies). 
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The intron-exon borders of selected exons is also determined and genomic PGR 
products analyzed to confirm the results. PGR products harboring suspected mutations 
is then cloned and sequenced to validate the results of the direct sequencing. 

PGR products is cloned into T-tailed vectors as described in Holton, T.A. and 
5 Graham, M.W., Nucleic Acids Research, 19: 1 156 (1991) and sequenced with T7 
polymerase (United States Biochemical). Affected individuals are identified by 
mutations not present in unaffected individuals. 

Genomic rearrangements are also observed as a method of determining 
alterations in a gene corresponding to a polynucleotide. Genomic clones isolated 

10 according to Example 2 are nick-translated with digoxigenindeoxy-uridine 5 - 

triphosphate (Boehringer Manheim), and FISH performed as described in Johnson, 
Gg. et al.. Methods Gell Biol. 35:73-99 (1991). Hybridization with the labeled probe is 
canried out using a vast excess of human cot-1 DNA for specific hybridization to the 
corresponding genomic locus. 

1 5 Ghromosomes are counterstained with 4,6-diamino-2-phenylidole and 

propidium iodide, producing a combination of G- and R-bands. Aligned images for 
precise mapping are obtained using a triple-band filter set (Ghroma Technology, 
Brattleboro, VT) in combination with a cooled charge-coupled device camera 
(Photometries, Tucson, AZ) and variable excitation wavelength filters. (Johnson, Cv. 

20 et al.. Genet. Anal. Tech. Appl., 8:75 (1991).) Image collection, analysis and 

chromosomal fractional length measurements are performed using the ISee Graphical 
Program System. (Inovision Gorporation, Durham, NG.) Ghromosome alterations of 
the genomic region hybridized by the probe are identified as insertions, deletions, and 
translocations. These alterations are used as a diagnostic marker for an associated 

25 disease. 

Example 22; Method of Detecting A bnormal Levels of a Polypeptide in a 
Biological Sample 

A polypeptide of the present invention can be detected in a biological sample, 
30 and if an increased or decreased level of the polypeptide is detected, this polypeptide is 
a marker for a particular phenotype. Methods of detection are numerous, and thus, it is 
understood that one skilled in the art can modify the foUowmg assay to fit their 
particular needs. 

For example, antibody-sandwich ELISAs are used to detect polypeptides in a 
35 sample, preferably a biological sample. Wells of a microtiter plate are coated with 

specific antibodies, at a final concentration of 0.2 to 10 ug/ml. The antibodies are either 
monoclonal or polyclonal and are produced by the mediod described in Example 10. 



/ 
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The wells are blocked so that non-specific binding of the polypeptide to the well is 
reduced. 

The coated wells are then incubated for > 2 hours at RT with a sample 
containing the polypeptide. Preferably, serial dilutions of the sample should be used to 
5 validate results. The plates are then washed three times with deionized or distilled water 
to remove unbounded polypeptide. 

Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a 
concentration of 25-400 ng, is added and incubated for 2 hours at room temperature. 
The plates are again washed three times with deionized or distilled water to remove 
10 unbounded conjugate. 

Add 75 ul of 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl 
phosphate (NPP) substrate solution to each well and incubate 1 hour at room 
temperature. Measure the reaction by a microtiter plate reader. Prepare a standard 
curve, using serial dilutions of a control sample, and plot polypeptide concentration on 
1 5 the X-axis (log scale) and fluorescence or absorbance of the Y-axis (Imear scale). 

Interpolate the concentration of the polypeptide in the sample using the standard curve. 

Example 23: Formulating a Polypeptide 

The secreted polypeptide composition will be formulated and dosed in a fashion 

20 consistent with good medical practice, taking into account the clinical condition of the 
individual patient (especially the side effects of treatment with the secreted polypeptide 
alone), the site of delivery, the method of administration, the scheduling of 
administration, and other factors known to practitioners. The "effective amount" for 
purposes herein is thus determined by such considerations. 

25 . As a general proposition, the total pharmaceutically effective amount of secreted 

polypeptide administered parenterally per dose will be in the range of about 1 jig/kg/day 
to 10 mg/kg/day of patient body weight, although, as noted above, this will be subject 
to therapeutic discretion. More preferably, this dose is at least 0.01 mg/kg/day, and 
most preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. If 

30 given continuously, the secreted polypeptide is typically administered at a dose rate of 
about 1 ^g/kg/hour to about 50 ^ig/kg/hour, either by 1-4 injections per day or by 
continuous subcutaneous infusions, for example, using a mini-pump. An intravenous 
bag solution may also be employed. The length of treatment needed to observe changes 
and the interval following treatment for responses to occur appears to vary depending 

35 on the desired effect. 

Pharmaceutical compositions containing the secreted protein of the invention are 
admmistered orally, rectally, parenterally, intracistemally, intravaginally. 
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iniraperitoneally, topically (as by powders, ointments, gels, drops or transdermal 
patch), bucally, or as an oral or nasal spray. "Pharmaceutically acceptable carrier" refers 
to a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material or 
formulation auxiliary of any type. The term "parenteral" as used herein refers to modes 
5 of administration which include intravenous, intramuscular, intraperitoneal, intrastemal, 
subcutaneous and intraarticular injection and infusion. 

The secreted polypeptide is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semi-permeable 
polymer matrices in the form of shaped articles, e.g., films, or mirocapsules. 

10 Sustained-release matrices include polylactides (U.S. Pat. No. 3,773,919, EP 58,481), 
copolymers of L-glutamic acid and gamma-ethyl-L-glutamate (Sidman, U. et al., 
Biopolymers 22:547-556 (1983)), poly (2- hydroxyethyl methacrylate) (R. Langer et 
al., J. Biomed. Mater. Res. 15:167-277 (1981), and R. Langer, Chem. Tech. 12:98- 
105 (1982)), ethylene vinyl acetate (R. Langer et al.) or poly-D- (-)-3-hydroxybutyric 

15 acid (EP 133,988). Sustained-release compositions also include liposomally entrapped 
polypeptides. Liposomes containing the secreted polypeptide are prepared by methods 
known per se: DE 3,218,121; Epstein et al., Proc. Natl. Acad. Sci. USA 82:3688-3692 
(1985); Hwang et al., Proc. NaU. Acad. ScL USA 77:4030-4034 (1980); EP 52,322; 
EP 36,676; EP 88,046; EP 143,949; EP 142,641; Japanese Pat. Appl. 83-118008; 

20 U.S. Pat. Nos. 4.485,045 and 4,544,545; and EP 102,324. Ordinarily, the liposomes 
are of the small (about 200-800 Angstroms) unilamellar type in which the lipid content 
is greater than about 30 mol. percent cholesterol, the selected proportion being adjusted 
for the optimal secreted polypeptide therapy. 

For parenteral administration, in one embodiment, the secreted polypeptide is 

25 formulated generally by mixing it at the desired degree of purity, in a unit dosage 

injectable form (solution, suspension, or emulsion), with a pharmaceutically acceptable 
carrier, .i.e., one that is non-toxic to recipients at the dosages and concenu-ations 
employed and is compatible with other ingredients of the formulation. For example, the 
formulation preferably does not include oxidizing agents and other compounds that are 

30 known to be deleterious to polypeptides. 

Generally, the formulations are prepared by contacting the polypeptide 
uniformly and intimately with liquid carriers or finely divided solid carriers or both. 
Then, if necessary, the product is shaped into the desired formulation. Preferably the 
carrier is a parenteral carrier, more preferably a solution that is isotonic with the blood 

35 of the recipient. Examples of such carrier vehicles include water, saline. Ringer's 
solution, and dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl 
oleate are also useful herein, as well as liposomes. 
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The carrier suitably contains minor amounts of additives such as substances that 
enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at 
the dosages and concentrations employed, and include buffers such as phosphate, 
citrate, succinate, acetic acid, and other organic acids or their salts; antioxidants such as 
5 ascorbic acid; low molecular weight (less than about ten residues) polypeptides, e.g., 
polyarginine or tripeptides; proteins, such as serum albumin, gelatin, or • 
immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids, 
such as glycine, glutamic acid, aspartic acid, or arginine; monosaccharides, 
disaccharides, and other carbohydrates including cellulose or its derivatives, glucose, 

10 manose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or 
sorbitol; counterions such as sodium; and/or nonionic surfactants such as polysorbates, 
poloxamers, or PEG. 

The secreted polypeptide is typically formulated in such vehicles at a 
concentration of about 0.1 mg/raJ to 100 mg/ml, preferably 1-10 mg/ml, at a pH of 

15 about 3 to 8. It will be understood that the use of certain of the foregoing excipients, 
carriers, or stabilizers will result in the formation of polypeptide salts. 

Any polypeptide to be used for therapeutic administration can be sterile. 
Sterility is readily accomplished by filtration through sterile filtration membranes (e.g., 
0.2 micron membranes). Therapeutic polypeptide compositions generally are placed 

20 into a container having a sterile access port, for example, an intravenous solution bag or 
vial having a stopper pierceable by a hypodermic injection needle. 

Polypeptides ordinarily will be stored in unit or multi-dose containers, for 
example, sealed ampoules or vials, as an aqueous solution or as a lyophilized 
formulation for reconstitution. As an example of a lyophilized formulation, 10-ml vials 

25 are filled with 5 ml of sterile-filtered 1 % (w/v) aqueous polypeptide solution, and the 
resulting mixture is lyophilized. The infusion solution is prepared by reconstituting the 
lyophilized polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 

30 compositions of the invention. Associated with such container(s) can be a notice in the 
form prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. In addition, the polypeptides of the 
present invention may be employed in conjunction with other therapeutic compounds. 

35 
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Example 24: Method of Treating Decreased Levels of the Polypep tide 

It will be appreciated that conditions caused by a decrease in the standard or 
normal expression level of a secreted protein in an individual can be treated by 
administering the polypeptide of the present invention, preferably in the secreted form. 
5 Thus, the invention also provides a method of treatment of an individual in need of an 
increased level of the polypeptide comprising administering to such an individual a 
pharmaceutical composition comprising an amount of the polypeptide to increase the 
activity level of the polypeptide in such an individual. 

For example, a patient with decreased levels of a polypeptide receives a daily 
10 dose 0.1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the 

polypeptide is in the secreted form. The exact details of the dosing scheme, based on 
administration and formulation, are provided in Example 23. 

Example 25: Method of Treating Increased Levels of the Polypep tide 

15 Antisense technology is used to inhibit production of a polypeptide of the 

present invention. This technology is one example of a method of decreasing levels of 
a polypeptide, preferably a secreted form, due to a variety of etiologies, such as cancer. 

For example, a patient diagnosed with abnormaUy increased levels of a 
polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5. 

20 2.0 and 3.0 mg/kg day for 21 days. This treatment is repeated after a 7-day rest period 
if the treatment was well tolerated. The formulation of the antisense polynucleotide is 
provided in Example 23. 

Example 26; Method of Treatment Using Gene Therapy 
25 One method of gene therapy transplants fibroblasts, which are capable of 

expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a 
subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 
separated into small pieces. Small chunks of the tissue are placed on a wet surface of a 
tissue culture flask, approximately ten pieces are placed in each flask. The flask is 
30 turned upside down, closed tight and left at room temperature over night. After 24 

hours at room temperature, the flask is inverted and the chunks of tissue remain fixed to 
the bottom of the flask and fresh media (e.g.. Ham's F12 media, with 10% FBS, 

penicillin and streptomycm) is added The flasks are then incubated at 37°C for 

approximately one week. 
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At this time, firesh media is added and subsequently changed every several days. 
After an additional two weeks in culture, a monolayer of fibroblasts emerge. The 
monolayer is trypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier, P.T. et al, DNA, 7:219-25 (1988)), flanked by the long 
5 terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and 
Hindlll and subsequently treated with calf intestinal phosphatase. The linear vector is 
fractionated on agarose gel and purified, using glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified 
using PCR primers which correspond to the 5' and 3' end sequences respectively as set 
10 fordi in Example 1. Preferably, the 5' primer contains an EcoRI site and the 3' primer 
includes a Hindlll site. Equal quantities of the Moloney murine sarcoma virus linear 
backbone and the amplified EcoRI and Hindlll fragment are added together, in the 
presence of T4 DNA ligase. The resulting mixture is maintained under conditions 
appropriate for ligation of the two fragments. The ligation mixuire is then used to 
1 5 transform bacteria HB 101 , which arc then plated onto agar containing kanamycin for 
the purpose of confirming that the vector has the gene of interest properly inserted. 

The amphotropic pA3 17 or GP+aml2 packaging cells are grown in tissue 
culture to confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 10% 
calf serum (CS), penicillin and streptomycin. The MSV vector containing the gene is 
20 then added to the media and the packaging cells transduced with the vector. The 
packaging cells now produce infectious viral particles containing the gene (the 
packaging cells are now referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the 
media is harvested fi-om a 10 cm plate of confluent producer cells. The spent media, 
25 containing the infectious viral particles, is filtered through a millipore filter to remove 
detached producer cells and this media is then used to infect fibroblast cells. Media is 
removed from a sub-confluent plate of fibroblasts and quickly replaced with the media 
from the producer cells. This media is removed and replaced with fresh media. If the 
titer of virus is high, then virtually all fibroblasts will be infected and no selection is 
30 required. If the titer is very low, then it is necessary to use a retroviral vector that has a 
selectable marker, such as neo or his. Once the fibroblasts have been efficiently 
infected, the fibroblasts are analyzed to determine whether protein is produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or 
after having been grown to confluence on cytodex 3 raicrocarrier beads. 

35 
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Example 27: Method of Treatment Using Gene Therapy - In Vivo 

Another aspect of the present invention is using in vivo gene therapy 
methods to treat disorders, diseases and conditions. The gene therapy method 
relates to the mtroduction of naked nucleic acid (DNA, RN A, and antisense 
5 DNA or RNA) sequences into an animal to increase or decrease the expression 
of the polypeptide of the present invention. A polynucleotide of the present 
invention may be operatively linked to a promoter or any other genetic elements 
necessary for the expression of the encoded polypeptide by the target tissue.. 
Such gene therapy and delivery techniques and methods are known in the art, 
10 see, for example, WO90/1 1092, W098/1 1779; U.S. Patent NO. 5693622, 
5705151, 5580859; Tabata H. et al. (1997) Cardiovasc. Res. 35(3):470-479, 
Chao J et al, (1997) Pharmacol. Res. 35(6):5 17-522, Wolff LA. (1997) 
Neuromuscul. Disord. 7(5):314-318, Schwartz B. et al. (1996) Gene Ther. 
3(5):405-411, Tsurumi Y. et al. (1996) Circulation 94(12):3281-3290 
15 (incorporated herein by reference). 

The polynucleotide constructs of the present invention may be dehvered 
by any method that delivers injectable materials to the cells of an animal, such 
as, injection into the interstitial space of tissues (heart, muscle, skin, lung, liver, 
intestine and the like). These polynucleotide constructs can be dehvered in a 
20 pharmaceutically acceptable liquid or aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers to sequences 
that are free from any delivery vehicle that acts to assist, promote, or facihtate 
entry into the cell, including viral sequences, viral particles, liposome 
formulations, lipofectin or precipitating agents and the like. However, the 
25 polynucleotides may also be delivered in liposome formulations (such as those 
taught in Feigner P.L. et al. (1995) Ann. NY Acad. Sci. 772:126-139 and 
Abdallah B. et al. (1995) Biol. Cell 85(1): 1-7) which can be prepared by 
methods well known to those skilled in the art. 

The polynucleotide vector constructs of the present invention used in 
30 the gene therapy method are preferably constructs that will not integrate into the 
host genome nor will they contain sequences that allow for replication. Any 
strong promoter known to those skilled in the art can be used for driving the 
expression of DNA. Unlike other gene therapies techniques, one major 
advantage of introducing naked nucleic acid sequences into target cells is the 
35 transitory nature of the polynucleotide synthesis in the cells. Studies have 
shown that non-replicating DNA sequences can be introduced into cells to 
provide production of the desired polypieptide for periods of up to six months. 
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The polynucleotide construct of the present invention can be delivered to 
the interstitial space of tissues within the an animal, including of muscle, skin, 
brain, lung, liver, spleen, bone marrow, thymus, heart, lymph, blood, bone, 
cartilage, pancreas, kidney, gall bladder, stomach, intestine, testis, ovary, 

5 uterus, rectum, nervous system, eye, gland, and connective tissue. Interstitial 
space of the tissues comprises the intercellular fluid, mucopolysaccharide matrix 
among the reticular fibers of organ tissues, elastic fibers in the walls of vessels or 
chambers, collagen fibers of fibrous tissues, or that same matrix within 
connective tissue ensheathing muscle cells or in the lacunae of bone. It is 

10 similarly the space occupied by the plasma of the circulation and the lymph fluid 
of the lymphatic channels. Delivery to the interstitial space of muscle tissue is 
preferred for the reasons discussed below. They may be conveniently delivered 
by injection into the tissues comprising these cells. They are preferably delivered 
to and expressed in persistent, non-dividing cells which are differentiated, 

15 although delivery and expression may be achieved in non-differentiated or less 
completely differentiated cells, such as, for example, stem cells of blood or skin 
fibroblasts. In vivo muscle cells are particularly competent in their ability to take 
up and express polynucleotides. 

For the naked polynucleotide injection, an effective dosage amount of 

20 DNA or RNA will be in the range of from about 0.05 g/kg body weight to about 
50 mg/kg body weight. Preferably the dosage will be from about 0.005 mg/kg 
to about 20 mg/kg and more preferably from about 0.05 mg/kg to about 5 mg/kg. 
Of course, as the artisan of ordinary skill will appreciate, this dosage will vary 
according to the tissue site of injection. The appropriate and effective dosage of 

25 nucleic acid sequence can readily be determined by those of ordinary skill in the 
art and may depend on the condition being treated and the route of 
administration. The preferred route of administration is by the parenteral route of 
injection into the interstitial space of tissues. However, otiier parenteral routes 
may also be used, such as, inhalation of an aerosol formulation particularly for 

30 delivery to lungs or bronchial tissues, throat or mucous membranes of the nose. 
In addition, naked polynucleotide constructs can be delivered to arteries during 
angioplasty by the catheter used in the procedure. 

The dose response effects of injected polynucleotide in muscle in vivo is 
determined as follows. Suitable template DNA for production of mRNA coding 

35 for the polypeptide of die present invention is prepared m accordance with a 
standard recombinant DNA methodology. The template DNA, which may be 
either circular or linear, is either used as naked DNA or complexed widi 
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liposomes. The quadriceps muscles of mice are then injected with various 
amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1.5 cm incision is made 

5 on the anterior thigh, and the quadriceps muscle is direcdy visualized. The 

template DNA is injected in 0.1 ml of carrier in a 1 cc syringe through a 27 gauge 
needle over one minute, approximately 0.5 cm from the distal insertion site of the 
muscle into the knee and about 0.2 cm deep. A suture is placed over the 
injection site for future localization, and the skin is closed with stainless steel 

10 clips. 

After an appropriate incubation time (e.g., 7 days) muscle extracts are prepared 
by excising the entire quadriceps. Every fifth 15 um cross-section of the individual 
quadriceps muscles is histochemically stained for protein expression. A time course for 
protein expression may be done in a similar fashion except that quadriceps from 

15 different mice are harvested at different times. Persistence of DNA in muscle following 
injection may be determined by Southern blot analysis after preparing total cellular DNA 
and HIRT supematants from injected and control mice. • The results of the above 
experimentation in mice can be use to extrapolate proper dosages and other treatment 
parameters in humans and other animals using naked DNA of the present invention. 

20 It will be clear that the invention may be practiced otherwise than as particularly 

described in tiie foregoing description and examples. Numerous modifications and 
variations of the present invention are possible in light of the above teachings and, 
therefore, are within tiie scQpe of the appended claims. 

The entire disclosure of each document cited (including patents, patent 

25 applications, journal articles, abstracts, laboratory manuals, books, or other 

disclosures) in tiie Background of the Invention, Detailed Description, and Examples is 
hereby incorporated herein by reference. 
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Sequence Listing 

(1) GENERAL INFORMATIC»I : 

(i) APPLICAOT: Human Genome Sciences, Inc., et al. 

(ii) TITLE OF INVENTION: 207 Human Secreted Proteins 

(iii) NUMBER OF SEQUENCES: 800 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Hunan Genome Sciences, Inc. 

(B) STREET: 9410 Key West Avenue 

(C) CITY: Rockville 

(D) STATE: Maryland 

(E) COUNTRY: USA 

(F) ZIP: 20850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1.4Mb storage 

(B) COMPUTER: HP Vectra 486/33 

(C) OPERATING SYSTEM: MSDOS version 6.2 

(D) SOFTWARE: ASCII Text 

(vi) . CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

. (C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
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15 



(viii) ATTORNEY/AGEOT INFORMATION: 

(A) NAME: Kenley K. Hoover 

(B) REGISTRATION NUMBER: 40,302 

(C) REFERENCE/DOCKET NUMBER: PZ007PCT 

(vi) TELECOMMUNICATION INFORMATICS: 
(A) TELEPHONE: (301) 309-8504 
{B) TELEFAX: (301) 309-8439 



20 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 1: 

30 

GGGATCCGGA GCCCAAATCT TCTGACAAAA CTCACACATG CCCACCGTGC CCAGCACCTG 60 

AATTCGAGGG TGCACCGTCA GTCTTCCTCT TCCCCCCAAA ACCCAAGGAC ACCCTCATGA 120 

35 TCTCCCQGAC TCCTCAQGTC ACATC3CGTGG TGGTGGACGT AAGCCACGAA GACCCTGftGG 180 

TCAAGTTCAA CTCGTACGTG GACGGCGTGG AGGTGCATAA TGCCAAGACA AAGCCGCGGG 240 

AGGAQCAGTA CAACAGCACG TACCGTGTGG TCAGCGTCCT CACCGTCCTG CACCAGGACT 300 

40 

GGCTGAATGG CAAGGAGTAC AAGTGCAAGG TCTOCAACAA AGCCCTCCCA ACCCCCATCG 360 

AGAAAACCAT CTCCAAAGCC AAAGGGCAGC CCCGAGAACC ACAGGTGTAC ACCCTGCCCC 420 

45 CATCCCGGGA TGAGCTCACC AAGAACCAGG TCAGCCTGAC CTGCCTGGTC AAAGGCTTCT 480 

ATCCAAGCGA CATCGCCGTG GAGTGGGAGA GCAATGGGCA GCCGGAGAAC AACTACAAGA 540 

CCACQCCTCC CGTGCTGGAC TCCGACGGCT CCTTCTTCCT CTACAGCAAG CTCACCGTGG 600 

50 

ACAAGAGCAG GTGGCAGCAG GGGAACGTCT TCTCATGCTC CGTGATGCAT GAGGCTCTGC 660 

ACAACCACTA CACGCAGAAG AGCCTCTCCC TGTCTCCGGG TAAATGAGTG CGACGGCCGC 720 

55 GACTCTAGAG GAT '^33 



60 



(2) INFORMATION FOR SEQ ID NO: 2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Trp Ser Xaa Trp Ser 
10 1 5 



15 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
25 GCGCCTCGAG ATTTCCCCGA AATCTAGATT TCCCCGAAAT GATTTCCCCG AAATGATTTC 60 
CCCGAAATAT CTGCCATCTC AATTAG 86 

30 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTIC»I : SEQ ID NO: 4: 

GCGGCAAGCT TTTTGCAAAG CCTAGGC 27 

45 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
50 . (A) LENGTH: 271 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTCGAGATTT CCCCGAAATC TAGATTTCCC CGAAATGATT TCCCCGAAAT GATTTCCCCG 60 



AAATATCTGC CATCTCAATT AGTCAGC^AC CATAGTCCCG CCCCTAACTC CGCCCATCCC 120 
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GCCCCTAACT CCGCCCAGTT CCGCXTCATTC TCCGCCCTAT QGCTGACTAA TTTTTTTTAT 
TTATGCAGAG GCCGAGGCCG CCTCGGCCTC TGAGCTATTC CAGAAGTAGT GAGGAGGCTT 
TTTTGGAGGC CTAGGCTTTT GCAAAAAGCT T 

(2) INFORMATION FOR SEQ ID NOt 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LETCTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible. 

(D) TOPOIiOGY: linear 

<xi) SEQUENCE DESCRIPTION: SBQ ID NO: 6: 
GCGCTCGAGG GATGACAGCG ATAGAACCCC GG 



(2) INFORMATICai FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: . 



GCGAAGCTTC GCGACTCCCC GGATCCGCCT C 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGG6ACTTTC CC 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 
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10 



35 



45 



55 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GCGGCCTCGA GGGGACTTTC CCGGGGACTT TCCGC3GGACT TTCCGGGACT TTCCATCCTG 60 
CCATCTCAAT TAG '^^ 



(2) INFORMATION FOR SEQ ID NO: 10: 



15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTCGAGGGGA CTTTCCCGGG GACTTTCCGG GGACTTTCCG GGACTTTCCA TCTGCCATCT 60 
25 CAATTAC?rCA GCAACCATAG TCCCGCCCCT AACTCCGCCC ATCCCQCCCC TAACTCCGCC 120 
CAGTTCCGCC CATTCTCCGC CCCATGGCTG ACTAATTTTT TTTATTTATG CAGAGGCCGA 180 
GGCCGCCTCG GCCTCTGAGC TATTCCAGAA GTACJTGAGGA GGCTTTTTTG GAGGCCTAGG 240 

30 

CTTTTGCAAA AAGCTT 256 



(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2526 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 11: 

GACAGGCTAT CCGAGAATCT GAGAGCTGQG CCCGGCAATT CCTCCAGYTA CCCTTGTGAC 60 
CTAAGTCCAG TCACACATTT CCCAAAGTTT CTCTTTGTCA TAACCCTGGT CTGGCTGGTT . 120 

50 TTGRGGRCTT GAGAATOSGT CAGGGACTCC AGGCCAAGTC CAACAGAGAC CCCAAACCCA 180 

CCACACACCA GCAGCCACAA CCTCACCACC AACAAAGAGG ACTTTTGTGG GGCCACAAGT 240 

AAGAQGTCAT TTCIGGAATG GACTCAGACC TTTAAACAGG AGAGTTGAGC ACTTCCAGKS 300 

AGTTTTTAAG CAAGGCATGG GGAACAGGGA ATAGAACCTT TCAAAGAGGT TGCCCAGAGA 360 

AAAGCTQGGC CTCTIGCATT CQGCTTCCTT QGAGCAGCCT CTTCTGGCAG AAAGCCATCA 420 

60 GGTGCTCAAT CATCTTCTCC TGGCCAAGGC TCTGACCATG CTTAGTACTG GAATAGAGGT 480 
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GGCCAGGCCC CCAGCGACTC TTCTTCGCCT GATGTTTGTC CTCACAGGCA TGCCACC3TGG 540 

CCTGAGATGA TTCAGAACAA ATCATGCTAA CTTTGAATCC ATCCACSCCAC TTGCAAATGA 600 

^ TAATCAGAAG TCAGCTTGTT CACTGTTAGA AAGAAACTAA CAAAAGAGAA CCCAGAGCAA 660 

TCTAGAATCT TTCAGTGCTT GGCTTTCCAA GGATACTGCG GAGACTCTGG CXIMGCTGAT 720 

10 GAMCTTCTGA ARTGTCACTG GCACCATATG CAACAAGAAC CACCATTCAC TGAGTAGCTA 780 

ATCGGTTTGG GGCCTCGGAC ATTCCATCTG AGGTCCTTCC TGAACATCTC ACTCCACAGC 840 

AGAGGACCGG TTCCAGCTTA CCCAGAACCA CTCCTCCAGG AGAGCTGGAT GTTTTGCGTG 900 

CAACACCTTG AGCACTGACT GCTATTGTTC AAAAAAAGCC TTTQCTGCAT TCGGAGGACT 960 

GCCCCGTGCC CTCAGGTGAC TTCCTAACTA TGTGGTTTCA TTAGCGAATT TATTTTTTGT 1020 

20 GCTGGGTCGA CATTTGTATT TTGTTAGGTT GCTGTTTAAG CTCAAGTTTG CTGTGCTCTC 1080 

TGCAGCTACA AAACATCTTG GCATATTTAA GAKTGGCTTT TATAAATAGC TTTATTCTGA 1140 

TATTAATCAG ATTCCCAACT TTACTGAGAA TTAAGGACTG GGGTACTTTA AAGAAATGCA 1200 

25 

AATAGCAATT GAAGAAOCAC TGCTGCAGGT GGTAGCCCTG GCTAGACTGA ATTACACTAG 1260 

AAATCAGCCA GAAGGAAGCG TCCTTQGGAT CCXrAGATCAC TCTTTTTTTT TTTTTITrTA 1320 

30 AAAGGGGCAG CCXXTTGATG GCKaVTCTCT CTGAATAACA GTTACGTCTT CATATC6ATA 1380 

CCAGATGCCT TCTTCATCAT GCX^CTGAAG CCACTCACCA CCTTCAAGAA CATGCCAACC 1440 

9 

TCTGTCAGAT TCACTTACCC ACAAACAAGG AGGCACGTTT GGCACAAAGT GTrGTCCTCC 1500 

35 

AGGTCCAAGT GGACTCTACA GAGTCCTTGA CCTCAACACA CTGGATTCCA GGTGGACTGG 1560 

ACCAAGAGCA GGCAAAGACA CGGGAACTGA AAAACTCCAC AGGGTTTGGA GAATAGAAAT 1620 

40 GAAAAGCCAC CJTCATATAAC TCAAGAATAA AIKXHOTTTT QGAAATTTTA AAATTATCAT 1680 

CGAAGGTGCyr GAAACTATTT CPGGCCCPAA TGAAAGGAAA TCGOCAGTTG GGGATGAAAT 1740 

CACAGAGCCT <?ICTTTTATG ATATQGTTGG ATGnCCCACTG ATGAAATTTT AAAGGAGTTT 1800 

45 

CATTTTTAAA AGTGCGCATG ATTCTACATA TGAGAATTCT TTAGGCXIAAG AAACTGTCCT 1860 

TCGCTCAGAG GTOITCGGAA TTAAAGCAGA GAGAAGCCAT TCGTCATGCT TAGAACCAAG 1920 

50 GATGGTCATG TACACAAAGA CCATCGAGAC QGCCATTCTT GTTTACAAAA CACTTACCAA 1980 

GAAAGCACTT TOTAGQGGAA CTTTAGTAAG TTCTTCTCAT TTCATTATGT TTCTTCCAAG 2040 

GAAACAGGAG AGACTORATT AATAATTCTC TCTTTCCTCT TAAGCACTTT TAAAATAATA 2100 

55 

AAGTACATCT TCAAATTIGG GQGGGCATCT CTGATTTAAA AAAAGAAAAA QGCTGCTTGA 2160 

TOTATOrrAT GCAGAGACAC TCTGCXnCTG GTGGCTGCAG AGCAATACCC AAGCCTCATT 2220 

60 TCGAAGGCTC AACATITCGA ATTGCACTTT AATTGATTAA TCCTCAATTC ATCTGGCCTT 2280 
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10 



ACGGGATCGT GGGTCTOGGA CCCCAA!ITCA TTCTTATCTG CCAAAGAATT ATCTAGAAGC 
ACATCAAATA CCAGCACCCC ACCTGCACAA TGQGGGTQGA AAACTTTTGT ATCCCTAAGC 
ATATTATTTT ATAGTGTCTG CCATGCCATG TGGAAATACT TTATTTTTAA CCTCAGGATT 
TAAATAAAGT AAACACTATG ACATTTAAAA AAAAAAAAAA AAAACTCGAG GQGGGCCCQG 
TACCCA 



2340 
2400 
2460 
2520 
2526 



15 (2) INFOBMATION FOR SEQ ID NO: 12: 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1131 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CACrcCACCA GCTTTGTTAT CTGTAAAATG ATGATAATAC CAACACCTTC TTCTTQGGGT 
ACTGAAGATG AGAGAACATG ATATGTGTAA AGTGCCTTCC ACAATACCCA GAACATAGCA 
AACATCTAAT GAATGTAGTA ATAGTAATTA TTTTATTTTC TTTTGATTCA GTTQGGACTA 
TGTTGAGCTG TAACAGAATA CCCAAAATAA CTGTTTTAAA CAAATTAAAG TTTWSTTGTG 
AAGTTriGTT ACGAATTCAG ACAATCCAGG GCTTTTATAG ATGCACCAGG ATCAGCAGGT 
ACAAAGGCAT CTTTCCTCAT TTCTGCCAGT CTCAATGCAT GGGTTGCAAT CCAGARTCCA 
RGATGGCAGT TCCAGCCCTG GTT^CGCCCA TATTAGCACA CAGAAAGAAA GAGAAAGGGA 
TCTGCCTCTT CACTTTAATC ATAGCTCCCA CTAGATGCAC CCACTACTTC TGCTGATACT 
CCATTAGCTA ATCCTTOCTT ACATGGTCAC ACTTAGTTTC CAGAGAGACA TGTCTGGACA 
GTCATGTGCT CAATTAATAT CCAAGTGTCC AATTACTGAG AAAAAAAGAA ACTAGCACCT 
TTGCTTGGTr GCATTCCTCT TAGCATAAGC CACATTCTTT TTATGAAGTT GTCCTCAGTT- 
ACTTOGATGC CTCAGTTCTC CTTTCAWTTA GAAAWGCYCC TKGGACAYCC TGAAWCTGAC 
TTCTTTTGTC ATCAGCACCA TCACTACCAC TGCCYTCTTC AAAGCCACCA CGTTCTGTCC 
CCAGGATGCSrr TGCAACAACC ACCATAGGGA CTTTTTQCCT TCTACTTCCA CACAATAGNC 
CAGAlCTAAGC TTTTCAAAAT GTAGGTCAGA TCATGTCTCT CTCTTCCTCT TCAAAACCCT 
CCCGATGGCT TTTCATATTA CTCAAAAGAA AACCTAAAAC TTTGCTGTGA GATCTATGTG 
ACCCGGCTTA TTCTTCCTCT TACTTTATCT CTGTATTGCT CTTCCTCACT CTACTCCAGC 
CATCCCACCT CCTTOCTGCT TGTCCTATAC TCCTAAAAGA AGTTCAGTCT TCCCTTATGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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TATTTGCACT TAAAATAGAA AAAAAAAAAA AAAAAAAACT CGAGGGGGGC C 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 941 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiable 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GGCACGAGTA GCATTTCATT TAATCTGCAG GTATATTCTC CCAACAGTTT ATTGTCATGT 
GATGTCCTCA GCCAAGATTG TRAGGCAGAG AGGAGCTGTC CCAACCTACT ATACCACCGA 
GGCTCGAGAG ATCATATTTT TGGTATTAAA CTGGAGTCTC TCCATCCTTC ACATTGTTGA 
TGTCCTCTGT AGCAAACCGG AAAAGTCAGT GACAGAAGAT GCCGCTAGCG GTTTGAGCCA 
GAGAATOACA GCTCTGGTTT GGAGAAAAGG GCCGGATGGT GGCTCTAGAA AGCCCATCCT 
TCTCCTCTTC TTTTTTCTCC CCCTTATATT GTGCTTTCAT TCATTCATTC ATTCATCAAA 
CATTTCTTGA GCACCTATTA TGTGTCAAGC TCTGTGCTAG CCTCTGGAAA ACCTGCCCTC 
ATCTAGCTCA CTGTGGAGTA GGAGAAACAA TGACTACACT ATGATAAGCA CGGGTTGTCA 
GQGTCTCACA GAGCAGTGGC CCCTCATCCA GACCGATGAG GTCAAAGAAG GCATCCAGGC 
GAGGATGGTC TCAGAGCTAA CTGAAGAATG AGAGGGAGCT GCACCASCAG GGGTTGGAAC 
TGAAGGTGGC AGTGCCTGGA GTCTTGATTC CAGCAGAGGG AGAQCAGTCT GTGAAAAGGC 
ACCAAGGGTG GGAGAGQGCA GAGCACATGG AGGAACTTCA GGTAGTTCTG GATGGCSCTG 
GGGCAAAQCT AGAGAQGTAA GAAGAATCTA CAAATGTTCC TCGAGTTACA TGAACTTCCA 
TCCCAATAAA CCCATTQGAA AOGAAAAATT TAAGTCAGAA GTGCATTTAA GGCTQGTCCG 
ACTAGAATGA TTTTTACAAC GAATTGATCA CAACCAGTTA CAGATGTCTT TGTTCCTTCT 
CCACTCCCAC TGCTTCACCT GACTAGCCTT TAAAAAAAAA A 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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CNAGGGATAA CCCCAAAGNT GQGAAATAAA CCCTCAATTA AAGGGGGAAC CAAAAAGCIG 60 

GGAAGTTCCC CCCCGCGGTG GCGGCCNGNT CTAQGAACTA C3TGGAATCCC CCGQGGCTGC 120 

5 AGGGAATTCG GCACGGAGTG GGAATGTTGT TTGTATGATA CTATTTCCAC AAViATGCATT 180 

GAGACTTCGT KTGTGGCCTA GGACATGGTC AATTCTTTYT AAATATTCCG TGAATTTCTT 240 

TAGTGCATAT TCTCCGATOG GGGCTGTGGG GACAGAGTTC TAAATATGCT CATTAGATTA 300 

10 

AATCTCTTCA TTCTCTTCCT CACATCTTCT ATATCCTTAT TAATCTGTCA ATCTCTTCAA 360 

GAGAGGTGTT ATTAAAATCT CTCACTGTAT GTGTCACTTT GCCCTTAAAA TTCTGATGAT 420 

15 TTGCTTTATA AATCGTTATA ACCATTTTCC AGGAAGAACA TTAAAGAACT TTCCATTGGC 480 

ATTATCCAGT TTCCCTCAAA ATACTGGTTT TTTTTATTTT GGCTNCTAAG CAGCTATGAA 540 

TCCAGTTTCT CAGAAGCCCT TGTCTCAAGG CATTTGTTTC CAGATTACCT TCTTAGCATC 600 

20 

CACACTATGG GCTATTTTAG AAAAACAAAA AAAGTATCAA AATCATATAG CTATGATTTT 660 

CXrrGTCCTTG AAGGAQCCTT AAAGCTCATC TAGTCCAGCC AGTATTTGTT CATCCAAATT 720 

25 CTGCCAAGAA ATCTCTATTG TCAAGATATT CTTTACCATC TTTGGGACAT TCTCATTATT 780 

AGAAACAAAT CCTAAGAAGA AATTCTGCCA TAKACAACCC ATCCGTTCfTT TAAAAAAAAA 340 



30 



(2) INFORMATION FOR SEQ ID NO: 15: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:^ 1018 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDED^E:SS: double 

40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CTGTAATTTT TAATTTTCAT ATACCGTGCT TTGATTCTAA nTTATTTTT TGAGTTCTCT 60 

45 

GAAGCTTACA TATACAGAGT GCTTCAGGAA TGATCATTTT GTTATTATTC ATGCTTCTTA 120 

ACAATCTTGT TTTAGTCCAA GAAGATAATT jGCCAGAGAAA GAATACAGTG CAGGAAAGAA 180 

50 GARGCTGGAG CCAGTGGTGA AGARGGATTG AGARGACAGA CATTGTGGGA ATGAAATCAT 240 

GAATAATCGT GTTTTTGAAT TGTCCAAAAA CTTCTACAAA CCATGAAATG TTGGAGTTTA 300 

AATCTAATTG TTGAAAAATT CCCCACATTC CTTGTATCCC TTAGGTIGflG CATAATTCCA 360 

55 

CATCCGTOGA CTGATX5CACT TCCCAAGAGG QQGCCTCATT AACTCTTCCG AGGCAGCAGC 420 

AGCAAGGGCA CCCCCTCCTT TCCCCCCACA CCCCAYTTCT CATGGCTCTT CTTTCTCTCA 480 

60 TCTCATGCTT AGGTTAGAAA AGGGCACAAG GTAAGGAAGC CCTTOGGAAT AGGCTGAATC 540 
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TOSCTATCTA ATTTGGTGCC AAATACTTAA TGTGCTTGAA TTTAAAAACA GCAAACATGT 600 

AGAAAGCn-AA TTATAATTAT GAGGCCAGTT CTTTAAQCTA GCTTTTrTTC CCCTCTCAAA 660 

5 

CAGCATATTG GCTTGGATGT CAGCAGGAGA AASTGTTTTT TGCAATACAC ATAATGCATA 720 

TATGGTCCTG TTAGCAATCT ATAGAAAATA GATATTGCTC ATTAAGGTAA ATATTTTTGT 780 

10 TGATGAATGA TCTGGAATGG TCTGGACTTG TTGTGTGAAC AGGAAATTGC TCTGTAGGCT 840 

TTCACTTCTG AGGTAAAGAG TGAGGCTGGT AAGATTAATT AAAGTAAATA CTGTGACAAT 900 

AGGATGTCAA AACCAAAAAC GTGTTTCTGA AACTCAAGGA ATTAATGACA CATAGGGAAG 960 

rrTTTCCCAT ATTAAGCATA GAGTAGGAGA GGCAAGTCAA GAATAAAAAA AAAAAAAA 1018 
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(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEIX^SS: double 

(D) TOPOLOGY: linear 



55 

(2) INFORMATION FOR SEQ ID NO: 17: 
60 (i) SEQUENCE CHARACTERISTICS: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TTTAAGAAAT TAGTCAATCC CCGGNTGCAG GGAATTCGGC ACGAGGAGGA GGCCGTCAGC 60 

TCGCAGGAGC GCAQGATGGC AGCTGYTCCC CCGGGTTGCA CCCCCCCAGY TCTGCTGGAC 120 

35 ATAAGYTCGT TAACAGAGAG CCTGGGAGCT GGGCAGCCTG TACCTGTGGA GTGCCGGCAC 180 

CGCCTGGAGG TQGCTQGGCC AAGQAAGGGG CCTCTGAGCC CAGCATGGAT GCCTGCCTAT 240 

GCCTGCCAGC GCCCTACGCC CCTCACACAC CACAACACTG GCCTMTCCGA GCTGCTQGAG 300 

CATGGAGTCT GTCAGGAGGT GGAGAGAGTT CGGCGCTCAG AGAGGTACCA GACCATGAAG 360 

GTCCGCAGGG CAGGGCTCGG ACCTACCCCA GGAATGTCCT GCCCTGGGAA TGACAACACA 420 

45 GTCCACACCA TGCACGGGGA GGCAAACAGG GGCAGCTGAC CCAGCCCAGG GGTCAGANGA 480 

GGTCTIGCCG AGGAACTQGC AGCTAAGCTG ATACCTGATA TGCACWAGKC AGCCARGYGG 540 
AGACAGGCAA GGAAGAAGCT TGTTTTGAGG ACAGAATTTT CTAGATCACT CAGCACCATC 
TOGCTTITCG GGCTTTTTGT TTTATTTTGT TTTTGAGACG GGGTCTCGCT CTGTCGCCCA 
N 
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<A) LENGTH: 553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEC3NESS : doxible 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGCACAGGGC TATTTGCCCC TCTCTCCACA TGACAGAACT GCTCTAAGTT TCTTTGCTGC ■ 60 

10 TCTTCTCAGC TGTCAGACGG CTTGCTGCTT GTTTTCCACA CCACCATGTC TATTCTTTGC 120 
TX^CTTOAC TCTCCCTGTT TTTTTCCTTT TGTATTTCTT CTGGCTCTTG TCCCTTTTCC 180 
CACGTCTCWC AGCTTTCCTT TATTGCCACT TTCAGTCAGA GCAGTCCTGT (SCTTCTGGTG 240 
CCGGCATACA ATACTTACTT GAGTTTCTTG CSCTTTTCTTG ACTGTGCATC TCTTACTTCA 300 
ACATAGGAAT AGCCTGTCAT AGAATTTCTC CAGTTCCAGG C3CTCAAGAGG QAGAGTQCCA 360 
20 GAAAATTCAG ACTGTTTTCC CTGTCTTGGA TTGAATTCAT AAAGCAAAAC CAGTGTTTGT 420 
GTCAGGGTTT GCTGTGTCAT GCCTATAC3GT TGTTTGGGTG CAAACCTATA GAATCCAGCC 480 
TGCGAAAAGA AAGRAACCAG AGAATANCAG CATCAGAACA ATGCTTGACA TCATTTCTCA 540 
ATCAAGCAGT CCA 
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(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



553 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GGCACGAGCT GCCAACACTG AGGTCTTCGT GGCTTCTCAC ATCTAGATGT ATCCCTCTCA 60 

AATCTATCCT CTATCCAGGC ACCAGATTGA GGTATCTAAA ATGTCAACTT TCCAGTTACT 120 

45 CCTTCTTATA CTAGCCCAAT CAACTTACAA GATAAAGTCC AAGCCCCTTC ATATGACAAA 180 

CCACACCCTG CITAACTCTC CAGGTTTGAA TCCTTCATCT CCTACTTTAA ACTTTAAAAC 240 

CCAGCAGCAC GAAAGTCTCT CCTATGCATG TTGCCATATG CGTTCTCTCC ATCATGCATT 300 

TGCCTGAGCA AGATGTCTTG AjGTTAACATC TTATTCTTTA AGACTCATTG TGGTQGTAGA 360 

CAGCCTTTAA TAACGGATCC TTGGCCAGGC ACAGTGACTC ACACCTGTAA TCCCAGAACT 420 

55 TTGAAAGGCC AAAGAAGGAA GAAAGCTTGA GGCCAGTAGT TTGAGACCAG CCTGGGAAAC 480 

AGAGAGATAT CCCATCTCTA CCAAAAATTT AAAAAAATAT TAGCAGGGAG TAGTGGCATG 540 
CACAAGTGGT CCCAGCTCCA TGGGAGASTG AGGTAGGAAC ATCACTTGAG CCCAGGAAGT 

60 
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CAAGGCTCCA GTCAACCATG ATCAGAACAT TGCAMCCAG CTTGGGTAAC AGAGTGAGAC 660, 
CTTAGGTCAG AAAAATGAAT AAATAAGCAT AAAATTTTAA AAACTTAGCC AGGCATGGTG 720 
5 GCACACATXrr GTCGTCCCTG CTACTTAGGA GGCTGAGGTG AGAGGATCCT TGAGCCCAGG 
AGOTCAACAC TACAGTGAGC TATGATTGTG CCACTAAACT CCAACCTGGG TGAAAAAGCA 
AAACCCTGCC AAAAAAAAAA AAAAAAACT 

10 



780 
840 
869 
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(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 959 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



25 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GGCGAGCCGA GATCGTGCCA TTGCACTCCA GCCIGGGCAA CAAGAGTGAA ACTCTGTCTC 60 

AAAAAAAAAA AATTATAATA CTATATGCCA TAAAATGACA TTTCATATTT AAAGAGTTTT 120 

TTAAAACTCT TCTATTCACA TGCCATAATT TGAAACOCTA TTTCACTGAA TGAGAATGGT 180 

30 ATCTOITCTC CTCATTTTTT CATTTTTATC CTTAACAATT TCCACCACAG CCAGTGCATA 240 

TAATGGCAAT GACACCCAGG GATGGAATGA TAAGTTCCAT CRCMSCTCAG TCAAGACGCA 300 

GACTTCATCT GGCCCCAACA ACAGTCAATA ATGGAGTCTC CAAAATAAAG CTCTATAQGA 360 

AAGGTAAATA CCCGCTGCAC AAGAAACCAC AGCATCTAGG TTCTAACCCC ATCTCTATGA 420 

AGAGCTTCCT GGGAGAGTTT TGACATTOAA CAATCTGTCT GATKGCCAAT TTTYTTCTTC 480 

40 TATAAAATGA TAATOTTKGA YTCAAAGATC CAAAGTCAAT TCATGGTCTA AAACTTAATG 540 

ATTTirrTAG GTTTTGKGAC ATTTCACTGT ACACTGTAGT AATTTATATC TTATTTTCCC 600 

ACTAATTTAG AAAAATATYT AAATGATCCT TAATTQGCAA TQGGTCCTAA GAATrTTGTT 660 

TTAAATCCCT GTTACCCAAA AGAGCCCrPT TTTGTATCTC GCAGTAOTTA CAAGGATCTT 720 

TCTAAATCTT AAAAAAAAAA AAAAAAGAAA GAAAGAAAAG AAAAGAAAAA AAGTCAGCCG 780 

50 GGCGTGOTGG CTCATGCCTG TAATCCCAGC ACTTTGGGAC CAAGGTGGAC AGATCACGAG 840 
GTCAGGAGAT GGAGACCATC COSGCCAACA TGGAGAAACC CTGTCTCTAC TAAAAAAAAA 



900 



AAAAACTCGA GQQGGGCCCG GTACCCAATN CGCCGGCTAG TGGTCGTAAA ACAATCAAA 959 



60 



(2) INFORMATION FOR SEQ ID NO: 20: 



wo 98/54963 



277 



PCTAJS98/11422 



10 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CGGQGCAGQG CTGTGTGGCA CCXXXTAGGGA GCGGGCCCAC CTGAGTCACT TTATTGGGTT 60 

CAGTCAACAC 'ITiVr mC TC CCTGTTTICT CTTCTGTGGG ATGATCTCAG ATGCAGGGGC 120 

TGGTTTKSGG GTTTTCCTGC TTGTGCCAAG GGCTGGACAC TGCTGGGGGG CTGGAAAGCC 180 

15 CCTCCCTTCC TGTCCTTCTG TGGCCTCCAT CCCCTCATGG GTGCTGCCAT CCTTCCTGGA 240 

GAGAGGGAGG TGAAAGCTGG TGTGAGCCCA GTGGGTTCCC GCCCACTCAC CCAGGAGCTG 300 

GCTGGGCCAG GACCGGGAGA GGGAGCACTG CTGCCCTCCT GGCCCTGCTC CTTCCGCAGT 360 

TAGGGGTO6A CCGAGCCTCG CTTTCCCCAC TGTTCTGGAG GGAAGGGGAA GGAGGGGGTC 420 

TTCAGGCTOG AGCCAGGCTG GGGGTGCTGG GTGGAGAGAT GAGATTTAGG QGGTGCCTCA 480 

25 TCGGGTOGGC AGGCCTGGGG TGAAATRAGA AAGGCCCAGA ACGTGCAGGT CTGCGGAGGG 540 

GAAGTOTCCT GAGTGAAGGA GQGGACCCCC ATCCTGGQGG ATGCTGGGAG TGAGTGAGTG 600 

AGATGGCTGA GTCAGGGTTA TGGGGAGCCT GAGGTTTTAT GGGCCTGTGT ATCCCCTTCT 660 

CCCGGCCCCA GCCTGCCTCC CTCCTGCCCG CCTGGCCCAC AGGTCTCCCT CTGGTCCCTG 720 

TCCCTCTCGT QGTTGGGGAT GGAGCGGCAG CAAGGGGTGT AATGGGGCTG QGTTCTGTCT 780 

35 TCTACAQGCC ACCCCGAGGT CCTCAGTGGT TGCCTGGGGA GCCGGACGGG GCTCCTGAGG 840 

GGTACAGGTT GGGTCGGCCC TCCCTGAGGG TCTQGGGTCA GGCTTTGGCT CTGCTGCCTC 900 

OXy^GTCACCA AGTCACCTCC CTCTGAAAAT CCAGTCCCTT CTTTGGATGT CCTTGTGAGT 960 

CACTCTCGGC CTCGCTGTCG TCCCTCCTCA GCTTCTTGTT CCTGGGACAA GGGTCAAGCC 1020 

AGGATGGGCC CAGGCCTGGG ATCCCCCACC CCAGGACCCC CAGGCCCCCT CCCCTGCTGC 1080 

45 TTTGCGGGGG GCAGGGCAGA AATGGACTCC TTTTGGGTCC CCGAGGTGGG GTCCCCTCCC 1140 

AGCCCTOCAT CCTCCGTGCC STAGACCTGC TCCCCAGAGG AGGGGCCTTG ACCCACAGGA 1200 

CGKJIGGTGG CGCCTOGCAC TCAGGGACCC CCAGCTQCCC CAGCCCTGGT CTCTGGCGCA 1260 

TCTCTTCCCT CTTOICCCGA AGATCTGCGC CTCTAGTGCC TTTTGAGGGG TTCCCATCAT 1320 

CCCTCCCTGA TATTGTATTG AAAATATTAT GCACACTGTT CATGCTTCTA CTAATCAATA 1380 

55 AACGCTTTAT TTAAAGCCAA AAAAAAAAAA AAAAAACTCG AGGGGGGGCC CGTACCCAAT 1440 

TCGCCA 1^^^ 

60 
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(2) INFORMATION FOR SBQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CAAAAAATAA TAATCATAAT TTAAAATAAA TAAGTAACTA ATAAAAAGAT TTTATATCCC 60 

AGTCTTATGA TGTTGGTTGG CAAGGCTAGA TAAAAAGATG TTAGAATGAA AGAACATATT 120 

15 

TTTAGTGATA TGTAAATGAA GGATTCTACA ATAGTCATAT ATTTTTATAT GAATGAATGT 180 

TGGGTTGGGC TGGAGAQGTA TGTGTGTGTA AATATAAAGG TCTCACATTC AGAGTATAGC 240 

20 TCTGAAATAA TGGAACTCAT GTCTACAATT CAACATGCAT CTGTATAGTT ACATCTCATG 300 

TAAATATACA CAGACATATT TTGCAGCCAG TAATTGACAG TTAATGTCCA AAACAGGTGA 360 

TTGATAGGTA ACAGAAATTA GATAACCACC AATTTTGCCC AAGAGAAAGA CTAGAAGGAC 420 

25 

TAAAAGCAGT TGAATGTATG GTACTGACAT TGTCATAAGC AGTCTGATAA CCAGTTTATT 480 

GAAACGTCTO CATTAACAGA GAATTTAATT TTAAACCCAT AATTTCTCCT ATCCATTAAA 540 

30 ATATTATAAT TGTTAGTAGT ATGAAACCAA CAGGAAATGT TTTTTAATCA TTTAGTGAGG 600 

TGATTCATTT GTTTCATGGG CAAACACTAT CCAQGAAAAG CCTTGCTTGC CTGTTTCCCA 660 

AAGAGCTCTA AGAAATAGAA TCAAGTGTAA AATGGTTCAG ACCATTCAGG ATTTCTTGTC 720 

35 

ACTCTTCTCA ACCCCGATCT TCCTGTTATT ACTGATGTTT GAA2^CCTGT CATTAGCCCC 780 

GGCCTCGTTA AAGCCCCTCA GAGTCACCTC TCATTCATAG CAATAGAATT CAACCCCAAG 840 
40 TOGTTCATGG TGTCCCCAGC ACAGCCGAGA GACCTGATCT CTGGATTCAG TGCTTTTAGC 900 
TCTTCGAGTT TACCCTAAGA TACCTTCGQG CAATATTTTT AACCAACCCA AAAGCTCTTC 960 

AGGTCATTTC TCAAGAGGAC AAGGTGAATC TrGQCTTGGA ACACCATTTT TGGQCTCTTG X020 

45 

CTACTCAATG AATCAGAAAG GAATTTTTTC TCAAGAGCAT TAGAAAGTAA AGGAGATGTT 1080 

AAAATAAGTT CTTGAAGTAT GTTTTATATT TATCTAAAAC ACTGATTTTA AAAGTTTACA 1140 

50 TTCAAATCTG TATTCAAAAG AAGTACTGAT TTGTAATTAT TATAGTrTGT GTGTATCATC 1200 

CCCTTTTAAC OGTGCCTAAC AACTGTACTT AAATTTTGTT TTCCTAGTGT AACAAATGTT 1260 

TCCCATAAGA TTTTCTAGAG CCAAATAATG GGAGTGAAAA ATTCCTTAAG TGTTATATAA 1320 

55 

GAAAATATAT TAGAAAATCA GCTTTGGATT ATACGATTTC TAAAATATAC TAATACAGAA 1380 

TCCTCAGTAA TATGTTTTGA ATIGGATTTT TTCTCAGAAC TGTTACATAA TAAATAATAC 1440 

60 ATCAACCAGA AAAAAAAAAA AAAAAAATTN C 1*71 
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5 (2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1402 base pairs 

(B) TYPE: nucleic acid 
10 iC) . STRANDEDNESS : double 

(D) T0P0IXX3Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
15 AGGGACCTCT TGCCTGAQGA GATGCCCATT TCTGTCCTGG RTTACCCTCA CTGCGTQGTG 60 
CATCAGCTGC CAGAGCTGAC GGCGGAGAGT TTGGAAGCAG GTGACAGTAA CCAATTTTGC 120 
TGGAGGAACC TCTTTTCTTG TATCAATCTG CTTCGGATCT TGAACAAGCT GACAAAGTGG 
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AAGCATTCAA GGACAATCAT GCTGGTGGTG TTCAAGTCAG CCCCCATCIT GAAGOGGGCC 240 



300 



CTAAAGGTGA AACAAGCCAT GATGCAGCTC TATGTGCTGA AGCTGCTCAA GGTACAGACC 
25 AAATACTTGG QGCGGCAGTC GCGAAAGAGC AACATQAAGA CCATGTCTGC CATCTACCAG 360 
AAGGTGCGGC ATCGGCTGAA CGACGACTGG GCATACGGCA ATGATCTTGA TQCCCGGCCT 420 
TGGGACTTCC AGGCAGAGGA GTGTGCCCTT CGTGCCAACA TTGAACGCTT CAACGCCCGG 480 
CQCTATGACC GGGCCCACAG CAACCCTGAC TTCCTGCCAG TGGACAACTG CCTGCAGAGT 540 
CyrcCTCGGCC AACGGGTGGA CCTCCCTGAG GACTTTCAGA TGAACTATGA CCTCTQGTTA 
35 GAAAGGGAGG TCTTCTCCAA GCCCATTTCC TGGGAAGAGC TGCTGCAGTG AGGCTGTTQG 



600 
660 



TTAGGGGACT GAAATCGAGA GAAAAGATGA TCTGAAGGTA CCTGTQGGAC TGTCCTAGTT 720 

CATTGCTGCA GTCCTCCCAT CCCCCACCAG GTGGCAGCAC AGCCCCACTG TGTCTTCCGC 780 

A£?rCTCTCCT GGGCTTCGGT GAGCCCAGCT TGACCTCCCC TTGGTTCCCA GGGTCCTGCT 840 

COGAAGCACT CATCTCTGCC TGAGATCCAT TCTTCCTTTA MTTCCCCCAM CCTCCTCTCT 900 

45 TGGATATGGT TQGTTTTGGC TCATTTCACA ATCAGCCCAA GGYTGGGAAA GCTGGAATGG 960 

GATGGGAACC CCTCCGCCGT GCATCTRAAT TTCAGGGGTC ATGCTGATGC CTCTCGAGAC 1020 

ATACAAATCC TTCCCTTTOT CAGCTTQCAA AGGAGGAGAG 1TTAGGATTA GGGCCAGGGC 1080 

50 

CAGAAACTCG OTATCITCGT TGTGCTCTGG GGTGGGQGTG GGGTGTTTCT GATGTTATTC 1140 

CAGCCTCCTC CTACATTATA TCCAGAAGTA ATTQCQGAGG CTCCTTCAGC TGCCTCAGCA 1200 

55 CTTTGArrTT GGACAGGGAC AAGGTAGGAA GAGAAGCTTC CCTTAACCAG AGGQGCCATT 1260 

TTTCCTTTTG GCTTTCGAGG GCCTGTAAAT ATCTATATAT AATTCTGTGT GTATTCTGTG 1320 

TCATCTIGGG CTTTTTAATC TGATTGTGTA TTCTGTTTAC ATTAAAAAGA AGCAAAAATA 1380 

60 
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ATAAAAAAAA AAAAAAAAAA CT 1402 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHAKACTERISTICS : 

(A) LENGTH: 1047 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GGCACAQGGG ACTACAGGCA CCCACGACCA TACCCAGCTA ATTTTTGTAT TTTTTTGTAG 60 

AGATOGGGTT TCACGATGTC GCCCAGGCTG GTCTTGAACT CCTGGGCTTG AGCGATCTTC 120 

20 CCATCTTTCC ATCTTGGCGT CCTAAAGTGC TGGGACTGCA GGCATGAGCC ACCATGCCCA 180 

GCCAAGATTC TTATTGATTA CCATGTTQCT TCAAGAAGCC AAGCCAGTTT CCAATATTCC 240 

CCATTTGCTG GAGTCTTGGT ACTTTGGGTA GAAGCAACTG GTAAATTGTT AATTGGAACA 300 

NTTGGTQGTG TAGATAACCA CGTATGGCCA AACCTAGAGC ATCTAGGCTC ACAATTACTA 360 

TCCTGACTTG ATAACAAGTG TTCTGATATT AACCTGAAAA TQGGAATAAT GCCAAATCTG 420 

30 TCTAACTTAA CAlCTATATA CACAGTGGGG AGAACTGAAG TTATTAAACC TGGAATCTCT 480 

GTGATCAAGG CTAACAGTAG TTATCTAAGA AGCAAAGGAC CTACAATTCT TAGACTTGGA 540 

GTCATATTCT TTAAGGACGT GTTCTGAAAC TATATCAAGC ATCTGGTTTC CACGTATTTC 600 

TCCCTCAGAA ATTATGAAGT ACAAGTAAAA ATGAAGGTAC AGGGTAAGAC ACATGCTGCT 660 

TTCTTCCTCT OXSAGTGGAGA CAGTTTTCCA GCCATCTTAA CCCCTTWACA CAAAACAATT 720 

40 TGTGITTrAT AGCAAATAAG TGACTCAACA TAATTTCAAT ATGATGITTA TCCACCAGTA 780 

CTTTCCTTTC AGCTTCTAGT CCCATAARTG GTTTGTGAAG TCATCGGTTA CATTAGCCAA 840 

GATAGGCCTA GACTTCAAGT CTAGAATGTT TTTCCCACTA TATGCCAAAG TAGAATGTGG 900 

CyrATCTCAGG GTCATTTTTG TTGTTCAATT TCCCACCTGT ACAGTTGTTA TGATTCACTT 960 

TCCTTATGTG TCTAATAAAT CTTGTTCCAT GAAATGATCA AAAAAAAAAA AAAAAAAACT 1020 

50 CGAGGGGGGG CCCGGTACCC AAATCGC 1047 

55 (2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIOI: SBQ ID NO: 24: 

5 TTGGAAAGGG TCTAGCTCTT TCTCATTCAC CAACTATATT AGAAGCACTT GAGGGAAATT 60 

TACCACTCCA AATCCAAAGC AATGAACAGT CTTTTCTGGA TGATTTTATT GCCTGTGTCC 120 
CAGGATCAAG TGGTGGAAGG CTTGCAAGGT GGCTTCAGCC AGATTCATAT GCGGATCCTC - 180 

10 

AGAAAACATC TTTGATCCTG GAATAAGGAT GATATTCGTT GTCGTTGGCC TACCACCATA 240 

ACTGTTCAAA CAAAAGACCA GTATGGGGAT GTGGTACATG TTCCCAATAT GAAGGTAATT 300 

15 ATAACTGGAT TAAATTAGCA GACATCTATA TACTGGCTGC AATGACTGAT AAAATTTTAG 360 

AAATGCCAAG TGCTGAGRGT CCATTTGTTC TACCCTCTTT ATATAAAGGG TGATGCTGAA 420 

AGTTTGTTTA AATGACTTGT TTATATTAAT TAGTCCCCAA GTGTCCAAGT TACACCTGTT 480 

20 

TTTTTTGTGA GTlTOrTCTT TACATTTTGC TACCTGTTAC GGGGACTCAA AGGAGGGATA 540 
AGAAAGTATC CATCTAAAGA GTGCTAGACA CATACAGTGA AGCCCCTCAA TATGTATTGA • 600 

25 TTCAATAAAT GCATGAAAGA ATACATTTTT AAATTTTGTG TATAGTTTTG AAAGACTCAA 660 

GTACGTTCTG TGTTTGGTAT TACTGAAACC ACATTTTAAA AATAACACTC ATTAAGTTAG 720 

AAATATATCA GTTTAGATTG TAAAAGAATG AGGAATTGAA ATAGTTGTAT ACCATATTGA 780 

30 

TGAATATAGA GTTTTTAGGA TACCTCTTAC CTGAAATATT AATAATAATG TTTNCAGAGC 840 

ATATTATACA TAATTATTTG TGATTTAATC TGTTAATATG AATATCTCAT TTAAAACTTT 900 

35 TATTTCTGAA AAAATTATAT TGAATAAAAT TTTATATAGG CAGTCCCCAG CCCTTTCCTC 960 

CTTCAAAGTT GTCTTATAGA GTGATTGGTT 990 



40 



(2) INFORMATION FOR SBQ ID NO: 25: 



(i) SEQUENCE CHT^RACTERISTICS: 
45 (A) LENGTH: 1208 base pairs 

<B) TYPE: nucleic acid 

(C) STRANI^DNESS: double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TAATCGCTAC TATAGGGAAA GCTGGTCGCT GCAGGTACCG GTCCGGAATT CCGGGTCGAC 60 

CCACGCGTCC GAGCGAAATG GCGCCTCCGG CCCCCGGCCC GGOCTCCGGC GGCTCCGGGG 120 

55 

AGGTAGACGA GCTGTTCGAC GTAAAGAACG CCTTCTACAT CGGCAGCTAC CAGCAGTGCA 180 

TAAAOGAGGC GCASGGGTGA AQCTRTCAAG CCCAGAGAGA GACGTGGAGA GGGACGTCriT 240 

60 CCTGTATAGA GCGTACCTGG CGCAGAGGAA GTTCGG T GTG GTCCTGGATG AGATCAAGCC 300 
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CrcCrTCGGCC CCTGAGCTCC AGGCCGTGCG CATGTTTGCT GACTACCTCG CCCACX5AGAG 360 

TCGGAGGGAC AGCATOGTGG CCX3AGCTGGA CCGAGAGATG AGCAGGAGCK TGGACGTGAC 420 

5 

CAACACCACC TTCCTGCTCA TGGCTGCCTC CATCTATCTC CACGACCAGA ACCCX3GATGC 480 

CGCCCTCCCT GCGCTGCACC AGGQGGACAG CCTGGAGTGC ACAGCCATGA CAGTGCAGAT 540 

10 CCTGCTGAAG CTGGACCGCC TGGACCTCGC CCQGAAGGAG CTGAAGAGAA TGCAGGACCT 600 

GGACGAGGAT GCCACCCTCA CCCAGCTCGC CACTGCCTGG GTCAGCCTGG CCACGGGTGG 660 

TGAGAAGCTG CAGGATGCCT ACTACATCTT CCAGGAGATG GCTGACAAGT GCTCGCCCAC 720 

CCTGCTCCTG CTCAATOGGC P^GCCGGCCTG CCACATQGCC CAGGGCCGCT GGGAGGCCGC 780 

TGAGGGCXTTG CTCCAGGAGG CGCTAGACAA QGATAGTGQC TACCCRGAGA CGCTGGTCAA 840 

20 CCTCATCGTC CTCTCCXy^ ACCTKGGCAA GCCCCCTGAG GTGACAAACC GATACCTGTC 900 

CCAGCTCAAG GATCCCCACA GGTCCCATCC CTTCATCAAG GAGTACCAGG CCAAGGAGAA 960 

CGACTTIGAC AGGCIGGTGC TACAGTACGC TCCCAGCGCT GAGGCTGGCC CAGAGCTGTC 1020 

AGGACCATGA AGCCAGGACA GAGGCCAGGA GCCAGCCCTG CAGCCCTCCC CACCCGGCAT 1080 

CCACCTGCAT CCCTCTQGGG CAGGAGCCCA CCCCCAGCAC CCCCATCTGT TAATAAATAT 1140 

30 CTCAACTCCA RGGTGTICCA CCTGAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1200 
AAAAAAAA 
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(2) INFORMATION FOR SEQ ID NO: 26: 



1208 



(i) SEQUENCE CHARACTERISTICS: 
40 <A) LENGTH: 1922 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : double 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

carccTCCGCT ACTGAGCAGC GCCATGGAGG ACTCTGAAGC ACTGGGCTTC GAACACATGG 60 

GCCTCGATCC CCGGCTCCTT CAQGCTGTCA CCGATCTGGG CTGGTCGCGA CCTACGCTGA 120 

'TCCAGGAGAA GGCCATCCCA CTQGCCCTAG AAGGGAAGGA CCTCCTGGCT CGQGCCCGCA 180 

CGGGCTCCGG GAAGACQGCC GCTTATGCTA TTCCGATGCT GCAGCTCyrTG CTCCATAGGA 240 

55 AGGCGACAGG TCCGGTGGTA GAACAGQCAG TGAGAGGCCT TGTTCTTGTT CCTACCAAGG 300 

AGCTGGCACG GCAAGCACAG TCCATGATTC AGCAGCTGGC TACCTACTGT GCTCGGGATG 360 

TCCGAGTCGC CAATOTCTCA GCTGCTGAAG ACTCAGTCTC TCAGAGAGCT GTGCTGATGG 420 

60 
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AGAAGCCAGA TCTGGTAGTA GQGACCCCAT CTCGCATATT AAGCCACTTG CAGCAAGACA 480 

GCCTGAAACT TCGTCACTCC CTGGAGCTTT TGGTQGTGGA CGAAGCTGAC CTTCTTTTTT 540 

CCTTTCGCTT TGAAGAAGAG CTCAAGAGTC TCCTCTGTCA CTTGCCCCGG ATTTACCAGG 600 

CTTTTCTCAT GTCAGCTACT TTTAACGAGG ACGTACAAGC ACTCAAGGAG CTGATATTAC 660 
ATAACCCGGT TACCCTTAAG TTACAGGAGT CCCAGCTGCC TGGGCCAGAC CAGTTACAGC . 720 

ACnnCAGGT GGTCTCTGAG ACTGAGGAAG ACAAATTCCT CXTGCTGTAT GCCCTGCTCA 780 

AGCTCTCATT GATTCGGGGC AAGTCTCTGC TCTTTGTCAA CACTCTAGAA CGGAGTTACC 840 

15 GGCTACGCCT GTTCTTGGAA CAGTTCAGCA TCCCCACCTG TGTGCTCAAT GGAGAGCTTC 900 

CACTGCGCTC CAGGTGCCAC ATCATCTCAC AGTTCAACCA AGGCTTCTAC GACTGTGTCA 960 

TAGCAACTGA TCCTGAAGTC CTGGGGGCCC CAGTCAAGGG CAAGCGTCGG GGCCGAQGGC 1020 

CNAAAGGGGA CAAGGCCTCT GATOCGGAAG CAGGTGTGGC CCGGGGCATA GACTTCCACC 1080 

ATCTOTCTGC TGTCCTCAAC TTTGATCTTC CCCCAACCCC TGAGGCCTAC ATCCATCGAG 1140 

25 CIGGCAGGAC AGCACGOGCT AACAACXX7W5 QCATAGTCTT AACCTTTGTG CTTCCCACGG 1200 

AGCAGTTCTA CTTAGGCAAG ATTGAGGAGC TTCTCAGTGG AGAGAACASG GGCCXXATTC 1260 

TGCTCCCCTA CXIAGTTCCGG ATGGAGGAGA TCGAGGGCTT CXXOTATCGC TGCAGGGATG 1320 

CCATGCGCTC AGTGACTAAG CAGGCCATTC GGGAGGCAAG ATTGAAGGAG ATCAAGGAAG 1380 

AGCTTCTCCA TTCTGAGAAG CTTAAGACAT ACTITGAAGA CAACCCTAGG GACCTCCAGC 1440 

35 TCCTCCQGCA TGACCTACCT TTGCACCCCG CAGTGGTGAA GCCCCACXMG QGCCATGTTC 1500 

CTGACTACCT GGTTCCTCCT GCTCTCCGTG GCCTGGTRCG CXXTCACAAG AAGCGGAAGA 1560 

AGCTGTCTTC CTCTTGTAGG AAGGCTAAGA GAGCAAAGTC CCAGAACXCA CTGCGCAGCT 1620 

TCAAGCACAA AGGAAAGAAA TTCAGACCCA CAGCCAAGCC. C?rCCTGAGGT TGTTGGGCCT 1680 

CTCTGGAGCT GAGCACATTG TGGAGCACAG GCTTACACCC TTCGTGGACA GGCGAGGCTC 1740 

45 TGGTGCTTAC TCCACAGCCT GAACAGACAG TTCTGGGGCC GGCAGTGCTG GGCCCTTTAG 1800 

CTCCTTGGCA CTTCCAAGCT GGCATCTTGC CCCTTGACAA CAGAATAAAA ATTTTAGCTG 1860 

COCCAAAAAA AAAAAAAAAA AAAAAAACTC GAGQGGGGGC' CCGTACCCAA TTCQCCCTAT 1920 

AA 1922 
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(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1951 base pairs 
60 (B) TXPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 27: 

5 

TCGTCCCCAG AGCGGGCTGA GCCCCAGGCG SAQGGTGGCG GGGGAGCCTG GGGGAGCCGC 60 

CGCCACCTCC ACGGGCCTCT CTGAGCTCGG ACACCAC3CGC CCTCTCCTAT GACTCTGTCA 120 

10 AGTACACGCT GGTGGTAGAT GAGCATGCAC AGCTGGAGCT GGTGAC3CCTG CGCCGTGCTT 180 

CGGAGACTAC AGTGACGAGA GTGACTCTGC CACCGTCTAT GACAACTGTG CCTCCGTCTC 240 

CTCGCCCTAT GAGTCGGCCA TCGGAGAGGA ATATGAGGAG GCCCCGCQGC CCCAGCCCCC 300 

TGCCTCCCTC TCCGAGGAAC TCCACGCCTG ATGAACCCGA CGTCCATTTC TCCAAGAAAT 360 

TCCTGAACOT YTTCATGAGT GGCCGCTCCC GCTCCTCCAG TGCTGAGTCC TTCGGGCTGT 420 

20 TCTCCTGCAT CATCAACGGG GAGGAQCAGG AGCAGACCCA CCGGGCCATA TTCAGGTTTG 480 

TCCCTCGACA CGAAGACGAA CTTGAGCTGG AAGTQGATGA CCCTCTGCTA GTGGAGCTCC 540 

AGGCTGAAGA CTACTGGTAC GAGGCCTACA ACATGCGCAC TGGTGCCCGG GGTGTCTTTC 600 

CTCCCTATTA CGCCATCGAG GTCACCAAGG AGCCCGAGCA CATGGCAGCC CTGGCCAAAA 660 

ACAGTGACTG GGTGGACCAG TTCCQGGTGA AGTTCCTGGG CTCAGTCCAG GTTCCCTATG 720 

30 ACAAGGQCAA TGACGTCCTC TGTGCTGCTA TGCAAAAGAT TGCCACCACC CGCCGGCTCA 780 

CCGTCCACTT TAACCCGCCC TCCAGCTGTG TCCTGGAGAT CAGCGTGCQG QGTGTGAAGA 840 

TAGQCGTCAA GGCCGATCAC TCCCAGGAG6 CCAAGGQGAA TAAATGTAGC CACTTTTTCC 900 

AGTTAAAAAA CATCTCTTTC TGCGGATATC ATCCAAAGAA CAACAAGTAC TTTTCGCTTCA 960 

TCACCAAGCA CCCCGCCGAC CACCGGTTTG CCTGCCACGT CTTTGTGTCT GAAGACTCCA 1020 

40 CCAAAGCCCT GGCAGAGTCC GTGGGGAGAG CATTCCAGCA GTTCTACAAG CAGTTTGTQG 1080 

AGTACACCro CCCCACAGAA GATATCTACC TGGAGTAGCT GTGCAGCCCC GCCCTCTGCG 1140 

TCCCCCAGCC CTCAGGCCAG TGCCAGGACA GCTGGCTGCT GACAGGATGT GGCACTGCTT 1200 

GAGGAGGGGC ACCTCCCACC GCCAGAGGAC AAGGAAGTGG GGCQCTQGCC CAGGQTAGGG 1260 

GAGGGTGGGG CAA1GGGGAG AGGCAAATGC AGTTTATTGT AATATATGGG ATTAGATTCA 1320 

50 TCTATGGAGG GCAGAGTCGG CTGCCTGGGG ATTGGGAGGG ACAGGGCTTG GGGAGCAGGT 1380 

CTCTGGCAGA GAAGGATGTC CGTTCCAGGA GCACACGGCC CTGCCCCATC CTGGGCCTTA 1440 

CCTCCCCTGC CAGGGCrCGG GCGCTGTGGC TCCTGCCTTG ATGAAGCCCG TGTCCTGCCT 1500 

TCATGAAGCC TOIX3CCACCT GCAAGTGCCC GCCCTQCCCC TGCCCCAACC CCCACCGAAG 1560 

AGCCCK5AGC TCAGGCTGAG CCCAGCCACC TCCCAAGGAC TTTCCAGTGA GGAAATGC3CA 1620 

60 ACACGTOGAG GTCAAGTCCC TGrTCTCAGC TCCGTCATCT GCGQGGCTTC TGGGTGGCTC 1680 
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CTCCCACTGA CCTCACCGGC ATGCTGGCCT GTGGCAGGCC Ti^GGACCTCA GGCGGGGAGG 1740 

AGGAGCTCCC GCAAGGCCCT GTCCXTAGCAG AAGAGGGAGG CTTCCTGACT GACACAGGCC 1800 

5 

AGCCCCATCT TQGTCCTGTC ACCCTGGCCC CAACTATTAA AGTGCCATTT CCTGTCAAAA 1860 

AAAAAAAAAA AAAATCGGQG GGGGCCCGGA ANCCAATTTC CCCCAAAAAG GGGGGTTATA 1920 

10 AAAATTCCCN GGCNGTGTTT TTAAAAATTC G 1951 



15 (2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3989 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLXXfY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

25 GGCACAGGCC GCAGG<3IACC TATGGGCGCA TATAGGTTGT AATGAAACTG TAGTCTCAGT 60 

TOGAAGCCTA GACATGAAAT GGGTCAGTGA GCAAGGCTCT ATTCCTAGTC TCCAGCCATG 120 

CCTGTCGAAC CTGARCCCRC TCTCAGCACA TTGGACCCAG GCAGATGYAA AAAATTCACA 180 

30 

GAACTATCAT TTCGACTCAA GGGTTTGTAG ATTTCCTCCT TCATTCTAAT TTCAGTGTCT 240 

AAAATTCTTC CATCCRTGAA CGAGCTGGGC ATTTGATGAG ACAQGGCYGA ATACTGCAGT 300 

35 TTTCCTCCTA GAAATCATCT GGGGCATTTT CTTTGAACTG ATQQGAACAA TAAGGCATAA 360 

CTGTTTOCAC AAACTTCGGA TAARTGATTT TGGGATAACG ATCTACCAGA ATGGGGATAT 420 

TTCACCCTTG GTTCTGAGAT GCAAACCAAA GAATATCATG ACCAGCTTTC AGGCCTCCTG 480 

40 

AAGTATATCT CTCACATTGT CCTGTTCTCA TGCTGAQGAG CCTGAGATCC CTGTGTGGGG 540 

ATTAGACAGT QGACTGTTAT GGGTGTAGGT GAATTGGCTT ATTTTGTCTG TCCCTGTCTG 600 

45 AATGTATTGC AGGAAYTAAA AAGGACCAAG AAGAGGAAGA AGACCAAGGC CCACCATGCC 660 

CCAGGCTCAG CAGGGAGCTG CTGGAGGTAG TAGAGCCTGA AGTCTTGCAG GACTCACTGG 720 

ATAGATCSTTA TTCAACTCCT TCCAGTTGTC TTGAACAGCC TGACTCCTGC CAGCCCTATG 780 

50 

GAAGTTCCTT TTATGCATTG GAGGAAAAAC ATGTTGGCTT TTCTCTTGAC GTGGGAGAAA 840 

TTCAAAAGAA GGGGAAGGGG AAGAAAAGAA QGQGAAGAAG ATCAAAGAAG GAAAGAAGAA 900 

55 GGGGAAGAAA AGAAQGGGAA GAAGATCAAA ACCCACCATG CCCCAGGCTC AGCAQGGAGC 960 

TGCTGGATGA GAAAGRGCCT GAAGTCTTGC AGGACTCACT GGATAGATGT TATTCAACTC 1020 

CTTCAGTTCT GTTCAACTGT GTGACTCATG CCAQCCCTAC AGAAGTGCCT TTTATGTATT 1080 

60 
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QGAGCAACAG CATGTTGGCT TGGCTGTTGA CATGGATGAA ATTGAAAAGT ACCAAGAAGT 1140 

GGAAGAAGAC CAAGACCCAT CATGCCCXTAG GCTCAGCAGG GAGCTGCTGG ATGAGAAAGA 1200' 

GCCTGAAGTC TTGCAGGACT CACTGGATAG ATGTTATTCG ACTCCTTCAG GTTATCTTGA 1260 

ACTGCCTGAC TTAGGCCAGC CCTACAGCAG TGCKGTTTAC TCATTGGAGG AMCAKTACCT 1320 
TGGCTTKKCT CTTGACGTGG ASAAATTGAA AAGAAGGGGA AGGGGAARAA AAGAAGGGGA - 1380 

AGAAGATCAA AGAAGGAAAG AAGAAGGGGA AGAAAAGAAG GGGAAGAAGA TCAAAACCCA 1440 

CCATGCCCCA GGCTCAGCAG GGAGCTGCTG GATGAGAAAG QGCCTGAAGT CTTGCAGGAC 1500 

TCACTGGATA GATGrTTATTC AACTCCTTCA GGTTGTCTTG AACTGACTGA CTCATGCCAG 1560 

CCTTACAGAA GTGCCTTTTA YRTATTGGAG CAACAGYGTG TTGGCTTGGC TGTTGACATG 1620 

GATGAAATTG AAAAGTACCA AGAAGTQGAA GAAGACCAAG ACCCATCATG CCCCAGGCTC 1680 

AGCAGGGAGC TGCTGGATCA GAAAQAGCCT GAAGTCTTGC AGGACTCACT GGATAGATGT 1740 

TATTCGACTC CTTCAGGTTA TCTTGAACTG CCTGACTTAG GCCAGCCCTA CAGCAGTGCT 1800 

GITTACTCAT TGGAGGAACA GTACCTTGGC TTGGCTCTTG ACGTGGACAG AATTAAAAAG 1860 

GACCAAGAAG AGGAAGAAGA CCAAGGCCCA CCATGCCCCA GGCTCAGCAG GGAGCTGCTG 1920 

GAGGTAGTAG AGCCTGAAGT CTTGCAGGAC TCACTGGATA GATGTTATTC AACTCCTTCC 1980 

AGTTGTCTTG AACAGCCTGA CTCCTGCCAG CCCTATGGAA . GTTCCTTTTA TGCATTGGAG 2040 

GAAAAACATG TTGGCTTTTC TCTTGACGTG GGAGAAATTG AAAAGAAGGG GAAGGGGAAG 2100 

AAAAGAAGGG GAAGAAGATC AAMGAAGRAA AGAAGAAGGG GAAGAAAAGA AGGGGAAGAA 2160 

GATCAAAACC CACCATGCCC CAGGCTCAAC GGCGTGCTGA TGGAAGTGGA AGAGCSTGAA 2220 

GTCTTACAGG ACTCACTGGA TAGATGTTAT TCGACTCCGT CAATGTACTT TGAACTACCT 2280 

GACTCATTCC AGCACTACAG AAGTGTGTTT TACTCATTTG AGGAACAGCA CATCAGCTTC 2340 

GCCCTTTACG TGGACAATAG GTTTnTACT TTGACGGTGA CAAGTCTCCA CCTGGTGTTC 2400 

CAGATCGGAG TCATATTCCC ACAATAAGCA GCCCTTASTA AKCCGAGAGA TGTCATTCCT 2460 

GCAGGCAGGA CCIATAGGCA MGTGAAGATT TGAATGAAAG TACAGTTCCA TTTGGAAGCC 2520 

CAGACATAGG ATGGCTCAGT GGGCATGGCT CTATTCCTAT TCTCAAACCA TGCCAGTGGC 2580 

AACCTCTCCT CAGTCTGAAG ACAATGGACC CACGTTAGGT GTGACACGTT CACATAACTG 2640 

1GCAGCACAT GCCGQGAGTG ATCAGTCRGA CATTTTAATT TGAACCACGT ATCTCTQGGT 2700 

AGCTACAAAA TTCCTCAGGG ATTTCATTTT GCAGGCATGT CTCTGAGCTT CTATACCTQC 2760 

TCAAGGTCAK TGTCATCTTT GTGTTTAGCT CATCCAAAGG TGTTACCCTG GTTTCAATGA 2820 

ACCTAACCTC ATTCTTTGTG TCTTCAGTGT TGGCTTGTTT TAGCTGATCC ATCTGTAACA 2880 



wo 98/54963 



PCT/US98/11422 



287 



10 



15 



20 



25 



30 



35 



CAGGAGGGAT CCTTGGCTGA GGATTGTATT TCAGAACCAC CAACTGCTCT TGACAATTGT 2940 

TAACCCGCTA GRCTCCTTTG GTTAGAGAAG CCACAGTCCT TCAGCCTCCA ATTGGTGTCA 3000 

GTACTTAGGA AGACCACAGC TAGATQGACA AACAGCATTG GGAGGCCTTA GCCCTCCTCC 3060 

TCTCRATTCC ATCCTCTAGA GAACAGGAGT CAGGAGCCGC TGGCAGGAGA CAGCATGTCA 3120 
CCCAGGACTC TGCCGGTGCA GAATATGAAC AAYGCCATGT TCTTGCAGAA AACGCTTAGC . 3180 

CTCAGTTTCA TAGGAGGTAA TCACCA6ACA ACTGCAGAAT GTRGARCACT GAGCAGGACA 3240 

GCTGACCTGT CTCCTTCACA TAGTCCATRT CACCACAAAT CACACAACAA AAAGGAGARG 3300 

AGATATTTTC GGTTCAAAAA AAGTAAAAAG ATAATGTAGC TGCATTTCTT TAGTTATTTT 3360 

GARCCXrCAAA TATTTCCTCA TCTTTTTGTT GTTGTCATKG ATGGTGGTGA CATGGACTTG .3420 

TTTATAGAGG ACAGGTCAGC TGTCTGGCTC AGTGATCTAC ATTCTGAAGT TGTCTGAAAA 3480 

TGTCTTCATC ATTAAATTCA GCCTAAACGT TTTGCCGGGA ACACTGCAGA GACAATGCTG 3540 

TGiAGTTTCCA ACCTYAGCCC ATCTGCGQGC AGAGAAGGTC TAGTTTGTCC ATCASCATTA 3600 

TCATCATATC AGGACTGGTT ACTTGGTTAA GGAGQGGTCT AGGAGATCTG TCCCTTTTAG 3660 

AG&CACCTTA CTTATAATGA AGTATTTGGG AGGGTGGTTT TCAAAATTAG AAATGTCCTG 3720 

TATTCCRATC ATCATCCTCT AAACATTTTA TCATTTATTA ATCATCCCTG CCTGTGTCTA 3780 

TTATTATATT CATATCTCTA CGCTGGAAAC TTTCTGCCTC AATGTTTACT GTGCCrTTGT 3840 

TTTTGCTAGT GTGTGTTGTT GAAAAAAAAA ACATTCTCTG CCTGAGTTTT AATTTTTGTC 3900 

CAAAGTTATT TTAATCTATA CAATTAAAAG CTTTTGCCTA TCAAAAAAAA AAAAAAAAAA . 3960 

AAAAAAAAAA AAAAAGCGGA CXXX3TGGGC 3989 
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(•2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: do\Jible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CTCCTGTTCG CTGGCTGGGC TCCGCAGCAG GCTTGGCCAG CSGCTGACGG GTOGGCGGGC 
GGGTTTGK5T GAACAGGCAC GCAGCTGCAG ATTTTATTCT GGTAGTGCAN CCCTCTCAAA 
GGTTCAAGGA ACTGATGTAA CAGGGATTGA AGAAGTAGTA ATTCCAAAAA AGAAAACTTG 
GGATAAAGTA GCCGTTCTTC AGGCACTTGC ATCCACAGTA AACAGGGATA CCACAGCTGT 
GCCTTATGTG TTTCAAGATG ATCCTTACCT TATQCCAGCA TCATCTTTGG AATCTCGTTC 
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ATTTTTACTG GCAAAGAAAT CCGGGGAGAA TGTGGCCAAG TTTATTATTA ATTCATACCC 360 

CAAATATTTT CAGAAGGACA TAGCTGAACC TCATATACCG TGTTTAATGC CTGAGTACTT 420 

5 

TGAACCTCAG ATCAAAGACA TAAGTGAAGC CGCCCTGAAG GAACGAATTG AGCTCAGAAA 480 

AGTCAAAGCC TCTGTGGACA TGTTTGATCA GCTTTTGCAA GCAGGAACCA CTGTGTCTCT 540 

10 TGAAACAACA AATAGTCTCT TCGATTTWIT GTGTTACTAT GGTGACCAGG AGCCCTCAAC 600 

TGATTACCAT TTTCAACAAA CKSGACAGTC AGAAGCATTG GAAGAGGAAA ATGATGAGAC 660 

AOCTAGGAGG AAAGCTGGTC ATCAGTTTQG AGTTACATGG CGAGCAAAAA ACAACGCTGA 720 

15 

GAGAATCTTT TCTCTAATGC CAGftGAAAAA TGAACATTCC TATTGCACAA TGATCCGAGG 780 

AATGGTGAAG CACCGAGCTT ATGAGCAGGC ATTAAACTT6 TACACTGAGT TACTAAACAA 840 

20 CAGACTCCAT GCTGATCTAT ACACATTTAA TGCATTGATT GAAGCAACAG TATGTGCGAT 900 

AAATGAGAAA TTTGAGGAAA AATQGAGTAA AATACTG6AG CTGCTAAGAC ACATGGTTGC 960 

ACAGAAGGTG AAACCAAATC TTCAGACTTT TAATACCATT CTGAAATGTC TCCGAAGATT 1020 



25 

TCATGTGTTT GCAAGATCGC CAGCCTTACA GGTTTTACGT GAAATGAAAG CCATTGGAAT 1080 

AGAACCCTCG CTTGCAACAT ATCACCATAT TATTCGCCTG TTTGATCAAC CTGGAGACCC 1140 

30 TTTAAAGAGA TCAJTCCTTCA TCATTTATGA TATAATGAAT GAATTAATQ6 GAAAGAGATT 1200 

TTCrcCAAAG GACCCGGATG ATGATAAGTT TTTTCAGTCA GCCATGAGCA TATGCTCATC 1260 

TCTCAGAGAT CTAGftACTTG CCTACCAAGT ACAOGGCX:'!^ TTAAAAACCX; GAGACAACTG 1320 

35 

GAAATTCATT QGACCTCATC AACATCGTAA TTTCTATTAT TCCAAGTTCT TCGATTTGAT 1380 

TTCTCTAATG GAACAAATTG ATGTTACCTT GAAGTGGTAT GAGGACCTGA TACCTTCAGC 1440 

40 CTACTTTCCC CACTCCCAAA CAATGATACA TCTTCTCCAA GCATTGGATG TGGCCAATCG 1500 

GCTAGAAGTG ATTCCTAAAA TTTQGAAAGA TAGTAAAGAA TATOGTCATA CTTTCCGCAG 1560 

TCACCTCAGA GAAGAGATCC TGATGCTCAT GGCAftGGGAC AAGCACXXAC CAGAGCTTCA 1620 

45 

GGTCQCATTT GCTCACTGTO CTGCTGATAT CAAATCTGCX3 TATGAAAGCC AACCCATCAG 1680 

ACAGACTCCT CAGGATTGGC CAGCCACCTC TCTCAACTGT ATAGCTATCC TCTTTTTAAG 1740 

50 . GGCTGGGAGA ACTCAGGAAG CCTGGAAAAT GTTGGGGCTT TTCAGGAAGC ATAATAAGAT 1800 

TCCTAGAACT GAGTTGCTCA ATGAGCTTAT QGACAGTGCA AAAGTGTCTA ACAGCCCTTC 1860 

CCAGGCCATT GAAGTAGTAG AGCTGGCAAG TGCCTTCAGC TTACCTATTT GTGAGGGCCT 1920 

55 

CACCCAGAGA GTAATGAGTG ATTTTGCAAT CAACCAGGAA CAAAAiSGAAG CCCTAAGTAA 1980 

TCTAACTOCA TTCACCAGTG ACAGTGATAC TGACAGCAGC AGTGACAGCG ACAOTGACAC 2040 

60 CACTGAAGGC AAA1GAAAGT GGAGATTCAG GAGCAGCAAT QGTCTCACCA TAGCTGCTGG 2100 



wo 98/54963 



PCTAJS98/11422 



289 

AATCACACCT GAGAACTGAG ATATACCAAT ATTTAACATT GTTACAAAGA AGAAAAGATA 2160 

CAGATTTCGT GAATTTGTTA CTGTGAGGTA CAGTCAGTAC ACAGCTGACT TATGTAGATT 2220 

5 

TAAGCTGCTA ATATGCTACT TAACCATCTA TTAATGCACC ATTAAAGGCT TAGCATTTAA 2280 

GTAGCAACAT 'K3CGGTTTTC AGACACATGG TGAQGTCCAT GGCTCTTGTC ATCAGGATAA 2340 

10 GCCTGCACAC CTAGAGTCTC GGTGAGCTGA CCTCACGATG CTGTCCTCGT GCGATTGCCC 2400 

TCTCCTGCTG CTGGACTTCT GCCTTTGTTG GCCTGATGTG CTGCTGTGAT GCTGGTCCTT 2460 

CATCTTAGGT GTTCATGCAG TTCTAACACA GTTGGGGTTG GGTCAATAGT TTCCCAATTT 2520 

15 . 

CAGGATATTT CGATOTCAGA AATAACGCAT CTTAGGAATG ACTAAACAAG ATAATGGCAG 2580 

TTTAGGCTGC ACAACTGGTA AAATGACTGT AGATAAATGT TGTAATTAGT GTACACGTTT 2640 

20 GTATTTTTGT TAATATAGCC GCTGCCATAG TTTTCTAACT TGAACAGCCA TGAATGTTTC 2700 

ATGTCTCCCT TTTTTTTTTG TCTATAGCTG TTACCTATTT TAGTGGTTGA AATGAGAGCT 2760 

ACTTGATGACA GAAGGATGTG GAATGTCTTC TTGACATCAT TGTGTATTGC TGGTAATCAA 2820 

25 

GTTGGTAACG ACTACTTCTA GCAGCTCTTA CCACTATGAC TTAAGTGGTC CTGGAAGGCA 2880 

GTAACTGGAG GTTTXSCAGCA TTCCTGCXTT CATGAGGGCT TCTACCACTG ACCACTTTGC 2940 

30 AOTACCTGG CTCCCAGATT TACTTAGGTA CCCCACGAGT CGTCCACATA AGCAGCTTCA 3000 

TCmACCTT GCXyWSAGTTG ACAATTATGG GATACTCTAG TCTACTTATA CTTGTGTTCC 3060 

CATC1GTCTO CCA'TCCTCTG AAGQCCAGGA CCCAGTCATA CATCCTTAGA AACCAAAGTA 3120 

35 

TGGTTTTTGT TTTCTCTTGG AATGTCAGGT CTTAAGGCAT TTAATTGAGG GACAAAAAAA 3180 

AAAAAAAGCC GATATAGTAG CTAGCTACTT AAGCATCCAT GGGTATTGCT CCATATCAAA 3240 

40 GCAGATTTGC AGGACAGAAA GAGTAAATTA GCCTTCAGTC ITGGTTTACA GCTTCCAAGG 3300 

AGAQCCTTQG CCACXTTGAAA TGTTAACTCG GTCXXTTCCT GTCTCTAGTT CATCAGCACC 3360 

TGCAGATGCC TGACTCTTGT TAGCCTTACT ATTCAATACA GTCCTTAGAT TCACGGTATG 3420 

45 

CCTCnCCTA TCCAGGCACC TATTCTGAAT CACCATGTTG CTCIGCAGCT AGAGTTGATA 3480 

GGAGAAAATC CATTTQGGTA GATGGCCTAT GAATTTGTAG TAGACTTTCA AAATGAGTGA 3540 

50 TT I G T T AGCT TGGTACTTTT AAGTTTGTGG TACAGATCCT CCAAACCXOT ACTCTGAGCA 3600 

ATTAACTCCC TTGAACATAG AGAAAATTAA GGCCTCACAG GATGAGTCTC CATTCTCTGT 3660 

AAATCCTTAT TTTATCATAG TCTTTAGCCN CTACTATGAG TAAAATGTTC TCTTCNGCCG 3720 

55 

GGT G TGGTGA CTCAC ^''35 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEE^JESS : do\able 

(D) TOPOUXJY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

TAlCTAATTCA TTTAACTCCT CTTACATGAG TAGCGACAAT GAGTCAGATA TCGAAGATGA 60 

AGACTTAAAG TTAGAGCTGC GACGACTACG AGATAAACAT CTCAAAGAGA TTCAGGACCT 120 

15 GCAGAGTCGC CAGAAGCATG AAATTGAATC TTTGTATACC AAACTGGGCA AGGTGCCCCC 180 

TGCTGTTATT ATTCCCCCAG CTGCTCCCCT TTCAGGGAGA AGAOGACGAC CCACTAAAAG 240 

CAAAGGCAGC AAATCTAGTC GAAGCAGTTC CTTGGGGAAT AAAAGCCCCC AGCTTTCAGG 300 

TAACCTGTCT GGTCAGAGTO CAGCTTCAGT CTTGCACCCC CAGCAGACCC TCCACCCTCC 360 

TGGCAACATC CCAGAGTCCG GGCAGAATCA GCTC3TTACAG CCCCTTAAGC CATCTCCCTC 420 

25 CAGTGACAAC CTCTATTCAG CCTTCACCAG TGATGGTGCC ATTTCAGTAC CAAGCCTTTC 480 

TGCTCCAGGT CAAQGAACCA GCAGCACAAA CACTGTTGGG GCAACAGTGA. ACAGCCAAGC 540 

CGCCCAAGCT CAGCCTCCTC CCATGACGTC CAQCAGGAAG GGCACATTCA CAGATGACTT 600 

GCACAAGTTG GTAGACAATT GGGCCCGAGA TGCCATGAAT CTCTCAGGCA GGAGAGGAAG 660 

CAAAGGGCAC ATCAATTATG AGGGCCCTGG AATGGCAAGG AAGTTCTCTG CACCTGGGCA 720 

35 ACTGTGCATC TCCATCACCT CGAACCTGGG TGGCTCTGCC CCCATCTCTG CAGCATCAGC 780 

TACCICTCTA GGTCACTTCA CCAACTCTAT GTGCCCCCCA CAGCAGTATG GCTTTCCAQC 840 
TACCCCATTT GGCGCTCAAT GGAGTGQGAC GGGTGGCCCA GCACCACAGC CACTTGGCCA 



900 



GTTCCAACCT GTGGGAACTG CCTCCTTGCA GAATTTCAAC ATCAGCAATT TGCAGAAATC 960 



1020 



CATCAQCAAC CCCCCAGGCT CCAACCTGCG GACCACTTAG ACCTAGAGAC ATTAACTGAA 

45 TAGATCTCGG GGCAGGAGAT GGAATGCTGA GGGGGTGGGT GGGGGTGGGA AGTAGCCTAT 1080 

ATACTAACTA CTAGTGCTGC ATTTAACTGG TTATTTCTTG CCAGAGGGGA ATGTTTTTAA 1140 

TACTOCATO5 AGCCCTCAGA ATGGAGAGTC TCCCCCGCTC CAGTTATTGG AATGGGAGAG 1200 

GAAGGAAAGA ACAGCTTTTT TGTCAAQGGG CAGCTTCAGA CCATGCTTTC CTGTTTATCT 1260 

ATACTCAGTA ATGAGGATGA GGGCTAGGAA AGTCTTGTTC ATAAGGAAGC TGGAGAACTC 1320 

55 AATGTAAAAT CAAACCCATC TGTAATTTCG AGTGQGTGGA GCTCTTGCTT TTGGTACATG 1380 

CCCTGAATCC CTCACTCCCT CAAGAATCCG AACCACAGGA CAAAAACCAC CTACTGGGCT 1440 

cierccTACc cigccctcct cccnrnTT tacccctctc ttttttattt tttctttgct isoo 

60 
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CTTTAGAACC CAlOTGAAAAA TAOCAGQGTA CTC3GGGTGCA ACTCTTTCTT ATGATAGGTC 1560 
AITAGTGCTT TAAGCAAAAG ATATTAGCAG CTTTGACTGC AGCATTAGCA ATTAGGRAAA 1620 
5 AAAAAAANWA AAAACTCGAG GGGGGC3CCCG GTTACCCAAT TCGCCCT 1667 



10 (2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) UaKSTH: 1408 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

20 ATTACACACC TGAGCACTGT GCCTGGCAAG ACCTGTCTTA ATAGATTAGA GAACCACTGA 60 

TAGATGGTCA GCTTTCTCTA GCAGTGAGAA CCCTACATTT CAAATGTGGA TAGCACCTTT 120 

GCGGGGAAAC ATCACTTGGC ACATCTGCAT TCTTTTTTGA CACAGGGTCT CACTCTGTTG 180 

25 

CCCAQGCTAG AGTGCATGGC ACGATCTTAG CTCACTGCAA CCTCCACCTC CCAAGTTCAA 240 

GCGATTCTTC TGCCTCAGCC TCCTGAGCAG CTGGGATCAC AGACATGCGC TACCATGCCC 300 

30 AGCTAATTTT TTCTATTtTT TGTRTGTTTG TTTTTGTTTK TAAGTAGAGA CGGGCTTTCA 360 

CCACGTIGGS CAGGCAGC?rC TCGAACTCCT fGAMCTCAGGT GATCCACCCA CATCTGCGTT 420 

CCAATATCTT TCTCAACATA A1GATAGCCG TAATTAATAT TTTCCAGTAC ATTTTTATGC 480 

35 

CTTTACACAC GAGAGTCGTA GACAGACACA AACCCAGATC TGTCTGACTC CAAAGCCCGT 540 

TTCTCAICAT TCCITTTACG GTATCCTATA GTGGTATCCT TTACAGAAAG ACAGCTTTTA 600 

40 CCCAACAAAG ACTTAACTTC CCAQGATGCC AGAAQGACAA AGCGGGATTG CTTTTAAGRA 660 

GRAAGTTATC AAGAMCTTAT TTTATAAATG AGATTAGATA GGGAAAGGCA ATTTATCTTT 720 

ATTAAAAACT GAAAAGGCCA GCATAGGGAA GGAGGTCCTT CGGTGGTCTT TTTCAQQGAA 780 

45 

ATACrrCAGT TCCTTTTATT AGAAACAGAT AGTACCTAAG GTTTTGAGGT AGGWACAGCT 840 

TAAGGCA-rcC TAATCKTCAT GGGTCCTTCC ATAGTCATTT TKGTATTTTG GTTWACATTT 900 

50 GAGCAATAGG CAGCCCTTCA CTGCTGCTGG AYTCATTCCT GCCAYTATTA CAGGTGACAG 960 

AGGAGACAGG AGGTATGTCT TTTCTATTTT TAWACATGCT TTATATTTAA CACAAGCTCT 1020 

TCGGTATCTT AGATAAACAG AAGTTGCCTA GCACTCCTTT TAGTGCATTG AACCCTTTAA 1080 

CATTTAAGCA AAATAATAAA CAGTCTTTTG AGGTTCCTTA ACAATGAAAC GIGTTCGAGT 1140 

GGCAGCAGCG GAATCCATCC YTCTTdCCT GGAGTGTGCA AKAGTCCGTG GTCCPGAGTA 1200 

60 TCTCACACAG ATCTCGCATT TTATGTGTGA TGCTCTAATT AAGGCCATTG GTACAGAACC 1260 
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'.i) SZ^^UE^rCi raA?ACTERISTICS: 

(A) LZ2«rrr:: 2031 base pairs 
15 (3) TY?2: nucleic acid 

(C) STKANI^EDNESS: double- 

(D) TOPOLCGT/: linear 



20 



30 



40 



50 



1408 



>JGATTCAGAC GTCCTCXAG A.:^ATAA1GCA TTCTTrTGCA AAGGTGAATA TTTTTCICTr 1320 
AAAAAATATG TAC^iAGGTGG TATGTrCATT TATTAGTCTT GCTAAAAAAA AAAAAAAAAA 1380 

iJCrrTI>3GAGGG GG3C^:CC3GT ACCCAATT 

(2) INFCjC-GTICN FC?. SEQ ID NO: 32: 



60 
120 
180 
240 
300 



(xi) DESCRIPTION: SEQ ID NO: 32: 

AGGATATOCA TGATTCITAA CCAGGCTATA TGTTAAAAAA AAATTGGAAA ATGCAATACA 

irnrrATTA tacaaactac agaatgagta tgcaagtttt atttatcaaa atgtaatgga 

25 TmTAAAGG CTGAGAAATT TTCCTTATAC CTACCTTTTC AGTTATTTTA ATTATACCAA 
ATTATCAACT AGAATAGCTT CATCCATATG AAATATAAAA TGAAGAGACA CCTAGGCTCT 
ATCAGGCTTA GGATTCTTTG AACTTATTTC CACTTTAATT TCTCAGTGGA AG1TAAGAQG 
GGTGAGAAAJ^ CAAAGAAGGG GAAAAACTGA CAACTAACAA AACCAGCACC ACATCGCTAG 360 
STCGIGCITA CTAATTACCT TCTCAGGATT TTCCTCAGAT TGAAAAGCTT ATGAGGATTT 420 
35 CTrcGGA3TC TTAATAACCT GCCTGTTAGT ACAGAGCTtT CCTGATGATA TTTACTCTTG 480 
AGCACATCTG glTGTAAJ^ CTTAACTrTC TTTCTCCAGG AGQGTGGTGA TAGAAACAGA 540 
TGGTACTATT TATGAACTGA TCTTCTCCTG AAATGTTGAG GGTGGC5GAGA AAAGACTTTA 600 
AGGGAGa^JCSA GCCATCTATT TTGTTCCTAA AGCCACCTCT CAGCAGAATC GrCATGTTTT 660 
TCIX3ATGCAC CGCTCTCCTT CATGCCCAAG ATGACTTGCG AGGCAATCTC AGGAGCTGTG 720 
45 GACTTAACCR TTGCAAAGCA CACIGTCTTT CTCAGCGTrC TCTGCAAGTC AGTAGGTGTT 780 
AGTATGGTIG CAAAGWCAC TGICTCAGCA AAGTTOAACT GGGCTACCTC TCTACAQCTG 
TITCCTCAGA GGGAAAAATC TTCAGACCAG ATGGTGGAGC TCTGGAGTCA GAGGAAATGG 
GTOTCTTCAG CACAAftGCTC CTGCTTTTAC TTCAGCCACT TCTGACATTT TTACATACXXS 
AGCCTCAGAT TR1GTGATTA TCTCAAATCA AATCACTTTG ATGGAGATAA ATAATCAAAA 1020 
55 CTCTTITATA GTCATIGATT TCGTGAGAAC AGTAATGGAA AATGGTGTTG AAGGACTTCT 1080 
CATITITCGA.GCTITCCTIC CAGAGTC^ ^^^^ 
CCCAAAACCA TTATTACTGA TACTTGCACA CAGTCAAAAG CGCAGACTGG ATGGATGGTC 

60 



840 
900 
960 
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TTTTATAAGG CATTTAAGGG TACACTACTG TGTTTCACTG ACCATACATT TTTCTTAGCC 1260 

CCTCAAGTAA TATAGCACAG AGTTATGAAT GACAATTCCC CTAACCATTC CTCTTCATAT 1320 

CTGCCTCrrC CCCTTACCAT CGTAATTCTC CAAACTGGTC ATAAAGGCAC TCTGTGAAGA 1380 

TATTGGQGAC TGACATCTTA AGCTCTCACC TGGCTGCAGT AGGAAAGGCC AAACTGACGA 1440 
CAAAAAAAAA ATTCTTTATA AAGATGATAT GGTAACATGT ATCTTTGCCC TGGGTCIGGG . 1500 

TGGGTCCAGT CAGTCTCAGA TTTACAAGCA TTTAGGAGCC TAGGTAAAAG CTGCTAGTAT 1560 

TCTTTTAAAA GTTACATTTA TGACTTGCAA TGATAGAAAA CTCCTTCCAA TTAAATGGCA 1620 

15 TTTTATAATA TTATGTOTGT ACTTCACAGT GTTAAAAATA CCCTCATACX; TTATTGCATT 1680 

TGATCTTCAC AGAAAGIXSCA TTTTAACCAG TACTCTGGGT GCAATAAATA ATATGTAGAA 1740 

ATTTAAGTCC TCCAATTCCA GCATATCCAG TGAGTTTTGA CAGTGTGTTT ATGTGGAATG 1800 

20 

TTTAAGGATA TACAATTOTA CTTTATATAA ATTGGTTCTT GTTCTTCTTA AATGTGACAT 1860 

. GAAATAATTG TCCIGCTACA TTATACTGGA AATTAACAGG GGAAAAGGGA AGAGCTCTTG 1920 

25 GCTCCCTTGA GGTTCTGCTA GTGGTGTTAG GA GT G G TTAC AACTGAGCTT TTAGTAACCA 1980 

TTTAACCGTA TOTAAACTTG GTTTCTAATT AAAAAAAAAT TTCITITrCC A 2031 



30 



45 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 971 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI)NESS: double 

(D) TOPOLOGY: linear 

40 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CGCGTCGGAA CTCQGCCGCG GGACATCCAC GGGGCGCGAG TGACACGCGG GAGGGAGAGC 60 

AGICTICTGC TGGAGCCGAT GCCAAAAACC ATGCATTTCT TATTCAGATT CATTGTTTTC 120 

TTTTATCTCT GGGGCdTTT TACTGCTCAG AGACAAAAGA AAGAGGAGAG CACCGAAGAA 180 

GTCAAAATAG AAGTTTTGCA TCGTCCAGAA AACTC3CTCTA AGACAAGCAA GAAQGGAGAC 240 

50 CTACTAAATC CCCATTATGA CGGCTACCTG GCTAAAGACG GCTCGAAATT CTACTGCAGC 300 

CGGACACAAA ATCAAGGCCA CCCCAAATGG TTTGTTCTTG GTGITGGGCA AGTCATAAAA 360 

QGCCTAGACA TTGCTATGAC AGATATGTGC OCTGGAGAAA AGCGAAAAGT AGTTATACCC 420 

CCTTCATTTG CATACGGAAA GGAAGGCTAT GCAGAAGGCA AGATTCCACC GGATGCTACA 480 

TTGATTTTTG AGATTCAACT TTATGCTGTG ACCAAAQGAC CACGGAGCAT TGAGACATTT 540 

60 AAACAAATAG ACATCGACAA TGACAGGCAG CTCTCTAAAG CCGAGATAAA CCTCTACTTG 600 



55 
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CAAAGGGAAT TTGAAAAAGA TGAGAAGCCA CGTGACAAGT CATATCAGGA TGCAGTTTTA 660 

GAAGATATTT TTAAGAAGAA TCACCATGAT GGTGATGGCT TCATTTCTCC CAAGGAATAC 720 

AATCTATACC AACACGATCA ACTATAGCAT ATTTGTATTT CTACTTTTTT TTTTTAGCTA 780 

TTTACTOTAC TTTATGTATA AAACAAAGTC ACTTTTCTCC AAGTTGTATT TGCTATTTTT 840 

10 CCCCTATCAG AAGATATTTT GATCTCCCCA ATACATTCSAT TTTGGTATAA TAAATGTGAG 900 

GCTGTTTTGC AAACTTAT^ AAAAAWWAAA AAAACTSGAS GGGGGCCCGT ACCCAANTCG 960 

CCX3NATATGA T ^''^ 

15 



20 



30 



40 



50 



(2) INFOPMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1792 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: " 

GAACCCCCTT TCTCCTGGTA AAGGGTAAGG GGGGGGATAA TGTTTACCAC AGGTACGAAA 60 

TAGTCACTTT AACATTGAGA CCTCTGCCTC ATTGAATTCA GGTTTTTTAA GTACTTGAAA 120 

CICTTCAGAT TCTCCTTATT TTAGTTTCTT TTTACATTTA TGAAGTAGAA AGCATTGTTT 180 

35 TGTAAACTCyr TTIGAAAATA AATAGCCTAG TCTCTTATCC TCTTTAGCGT GGATTAAAGG 240 

TGAAGTTCTC CAAATGGGAG AGTGTTCACA GTAGATAGCT CAGATTGATT GAACACATTT 300 

GAGGAAGAGA CTCCTGCATG AGATACCAGC ATTTTTACAA ATACTTTTTA TGTACATTCT 360 

TTATTTTCTC ATTTTGTCAA CCCTCTCCCC AAGCACATCT TCTTTCCTTT TACTATQTCT 420 

ATCTAGGGAA AAACAAAACA AAAAATTQCA CTTACGTrAC ACTCCCAAAA TGTGGGTAAT 480 

45 CCGTOTCTTT CAAAAAACAT TTCTGTTTTT TGTTTTGrTTT TGGTCAGTCC ATTGCATAAG 540 

TGACAACTTT GQGTGCTrGT GGCACGTATG TATGAAGCGG GAGGGQGATG ASAATTGCCT 600 
GTCCTTCAGT ARGCTGTAAA AGTAATTTAC ATGTAAGTAA AAAGGGAAAA TAGAATAGAT . 660 

GCCAAAGTCA TTTATTCAGT CCTTAGTTTT CTTATGTGGC ATTACTGCAT CTGCTAGTTA 720 

CyiGAGAAAGC ACCCTCAGCT TTTACTGCTC CCCTCCCTGC CTGCCAACAC ACTTGATGTG 780 

55 TGCAAACAGC CCTCAAGTAT CTGTCAGATG ACCTATATAA GGTATTGAAT AAGGTATTCT 840 
TOTCAGTITA GAAATGGACT GGATAAAACT TACTTGGTTG TCATTATTIT ATCTCATTTG 



900 



60 



TCCTOTTACA TGCCCTATGT TAAGATAATT ATATTGCCAC TAATAATCAA GATGCTAAAT 960 
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GAGTATTACA ACTGGCTAAT ATCATTTTTT ATATACAAGG GTATGTGTAT ATTTGGAATT 1020 

GRTATCAGAA ACTCATTTGT ACXXMTTGA GTGATATTGC ACAACAAACA CAGATAYCTA 1080 

5 CAGACrcCGT TTTGATTTTC 'TCGTGTTCTT TATGATAATG ATCTTTGTAG ATTGGTTATT 1140 

TCTGTACTTT ATCTGTAATA AACTTTGTAG ATCCTGTGAA CCATTACTTT GCCTAAATCA 1200 

CTTGAGACTT GAGTCTTTAA TAACAAAGCA TCAATATTCA CTAAAGTCAA TCTCTTTTGA - 1260 

10 

GTTTCTGTGA CTTGGCTAGA AGCTCTTGAC ACTAAGGGAT TAGTGTTAAT TTTCCCTGGG 1320 

GGTGTTCCAC TAGGGCATTA OTGTATAATG ACTTGATGTT GCCACATAGA CTTCAAGATA 1380 

15 TATAATATTT TGAGGATTTT GTTGATTGGC CTATGTTTTA TTGCATAGTG TGAAACGTGT 1440 

AAAGCTTCGT TAACCTGTAT ATAGATAGCT TATTGTTGAC TAGTTATAGT GTATTTAGGG 1500 

TTGCCTGTAA TATTTAAGCT TCTTTACTGA TGTGTGTGCT GGTAGGAACA TATAATTTTT 1560 

20 

GTACATTATA TTTACTGAGA TGTTGCXrrTT TTTATTTTAC AAATACITTG GAATTCCAAT 1620 ^ 
GTCTTTTTTG CTTCCGTGAG GATTAATTTG GAAAGGTTTT TAAT6ACATT CCACTGATTT . 1680 

25 CAGATTTTGC TTGAGATTGA CTTCAATAAA TTGTCCTGTA TGTTCCAAAA AAAAATTAAA 1740 

AAACTCGAC5G GGGGCCCX5GT ACXXAANNCG 1792 

30 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
35 <A) LENGTH: 896 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 35: 

AGTTGNftNAC AACAG6ACCT 6AGTCCTTGG GCAGCACCAG TAGGTTGCCC CYTGCYTCVT 
GCCAGCYTCA CYTGCCACYT TYTGCCCCTY TCGGGATGCC TTCGCAGACA GAGYTYTTCG 

45 

CTGCCTCTGG TQGCCAVTCT TTGCTTTTGG TTYTCTTGCC CCTTQGCCTC CCTTTTTGTC 
CCCGGGCAGC CTTGTGTGAC CTGOCCTTTT CCCTCCCTTC CTTTCCAGGA CAAGCACGCC 
50 GAGGAGGTGC GGAAAAACAA GGAGCTGAAG GAAGAGGCCT CCAGGTAAAG CCTAGAGGCC 
AAAGAACTTT CCAGGTCAGC CGGACAGCTC CAGCAGCTCC ACGTTCCAGG CAGCCTCGMC 
CGCCGGCTCC GCTCCCAGCA CTGGGGTTTC GGGGGAGQGG GGTGGCCAAG GGGOGTTTCC 

55 

TCTGCTTTTG GTGTrTGTAC ATGTTAAGAA rTGACCAGTG AAGCCATCCT ATTrGTTTCC' 
GGGGAACAAT GAOGGGGTGG GARAGGGGAG AGGAGAGAGT TTGGGAAAGG GAGATGGAOA 
60 AGAACTCAAG GACATTGCAA COCTGCCCGG CGCAGATCTG ATTTTCACAT CTCTACCTGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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ACATTGAGCC TCCCAGGCAC CATGTTGAGG AGAGATGAAA ACCAGGGCGG TAGAACTTCA 660 

GGOTGAAGGA CAGAGTCCTG GGTGGGGCAG CGGCTGCAGG GCGCACCAGA GAACXTCAGCX: 720 

5 

AGAGGGGGTC TCAGTACCAG TQGTGTTGCT TCCACCCTGC AGCAGGTGGG ATGftGGTCTG 780 

TOrGTOIGTC TGAACCATCA TTTTTTGATC ATCATGACXA ATGAAACATT GAAAAAAAAA 840 

10 AAAAAAACTC GAGGGGGGCC CX3TACCCAAN TCGCCOIATA GTGATCGTAA ACAATC 896 

15 (2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

25 TCGACCCACG CGTCCGGTCA GCCAGTCGCA TCCAGCCATG ACAGCCTTCT GCTCCCTGCT 60 

CCTGCAAGCG CAGAGCCTCC TACCCAGGAC CATGGCAGCC CCCCAGGACA GCCTCAGACC 120 

AQGGGAGGAA GACGAAGGGA TGCAGCTGCT ACAGACAAAG GACTCCATGG CCAAGGGAGC 180 

30 

TAGGCCCGGG GCCAKCCGCG GCAGGGCTCG CTGGGGTCTG GCCTACACGC TGCTGCACAA 240 

CCCAACCCTG CAGGTCTTCC GCAAGACGGC CCTGTTGGGT GCCAATGGTG CCCAGCCCTG 300 

35 ARGGCAGGGA AKGTCAACCC ACCTGCCCAT CTGTGCTGAG GCATGTTCCT GCCTACCATC 360 

CTCCTCCCTC CCCGGCTCTC CTCOCAGCAT CACACCAGCC ATGCAGCCAG CAQGTCCTCC 420 
GGATCACYGT GGTTKGGTGG AGGTCTGTCT GCACTGGGAG CCTCARGARG GCTCTGCTCC . 480 

40 

^ACCCACTTGG CTATGGGAGA GCCAGCAGQG GTTCTGGAGA AAAAAACTGG TGQGTTAGGG 540 

CCTTGGTCCA GGAGCCAGTT GAGCCAGGGC AGCCACATCC AGGCGTCTCC CTACCCTQGC 600 

45 TCTGCCATCA GCCTTGAAGG GCCTCGATGA AGCCTTCTCT GGAACCACTC CAGCCCAGCT 660 

CCACCTCAGC CTTGGCCTTC ACGCTGTGGA AGCAGCCAAG GCACTTCCTC ACCCCYTCAG 720 

CGCCACGGAC CTYTYTGQQG AGTGGCCGGA AAGCTCCCSG GCCTYTGGCC TGCAGGGCAG 780 

50 

CCCAAGTCAT GACTCAGACC AGQTCCCACA CTGAGCTGCC CACACTCGAG AGCCAGATAT 840 

TTTTGTAGTT TTTATKCCTT TGGCTATTAT GAAAGAQGTT AGTGTGTTCC CTGCAATAAA 900 

55 CTityrrccTG AG 

60 (2) INFORMATION FOR SEQ ID NO: 37.- 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LEtTCTH: 1382 base pairs 
{B) TYPE: nucleic acid 
5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUEECE DESCRIPTION: SEQ ID NO: 37: 

10 AATTCGGCAC GAGCGGAGGC GAGGGAAACT RAGGGCGAAA GTTGTGTGTC GTGTTGGCAG 60 

GAGGGCCTAG AAGGGAAAGA CTCTCTAGTG GGACAATGTC ATATTATAAA TTTGGAATGC 120 

TGAATAGAAA ATTATAGATT TTGATATTGA AGGAAATGAA GCGAAGCYTA AATGAAAATT 180 

CAGCTCGAAG TACAGCAGGC TGTTTGCCTG TTCCGTTGTT CAATCAGAAA AAGAGGAACA 240 

GACAGCCATT AACTTCTAAT CCACTTAAAG ATGATTCAGG TATCAGTACC CCTICTGACA 300 

20 ATTATOATTT TCCTCCTCTA CCTACAGATT GGGCCTGGGA AGCTGTGAAT CCAGAGTTia; 360 

CrcCTGTAAT GAAAACAGTG GACACCQGGC AAATACCACA TrCAGTTTCT CGTCCTCTGA 420 

GAAGTCAAGA TTCTGrCTTT AACTCTATTC AATCAAATAC TGGAAGAAGC CAGGGTGGTT 480 

GGAGCTACAG AGATGGTAAC AAAAATACCA GCTTGAAAAC TTGGRATAAA AATGATTTTA 540 

AQCCTCAATG TAAACGAACA AACTTAGTGG CAAATGATGG AAAAAATTCT TGTCCAATGA 600 

30 GTTOQGGAGC TCAACAACAA AAACAATTAA GAACACCTGA ACCTCCTAAC TTATCTCGCA 660 

ACAAAGAAAC CGAGCTACTC AGACAAACAC ATTCATCAAA AATATCTGGC TGCACAATGA 720 

GAGGGCTAGA CAAAAACAGT GCACTACAGA CACTTAAGCC CAATTTTCAA CAAAATCAAT 780 

ATAAGANACA AATGTTGGAT GATATTCCAG AAGACAACAC CCTGAAQGAA ACCTCATTGT 840 

ATCAGTTACA GTTTAAGGAA AAAGCTAGTT CTTTAAGAAT TATTTCTGCA GTTATTGAAA 900 

40 GCATGAACTA TTGGCGTGAA CATGCACAGA AAACTGTACT TCTTTTTGAA GTATTAGCTG 960 

TTCTTCATTC AGCTCTTACA CCTQGCCCAT ATTATTCGAA GACTTTTCTT ATGAGQGATG 1020 

GGAAAAATAC TCTGCCITOT GTCTTTTATG AAATCGATCG TGAACTTCCG AGACTGATTA 1080 

GAGGCCGAGT TCATAGATCT GTTGGCAACT ATGACCAGAA AAAGAACATT TTCCAATGTG 1140 

TTTCTOTCAG ACCGGOGTCT GTTTCTGAGC AAAAAACTTT CCAGGCATTT GTCAAAATTG 1200 

50 CAGATOITCA GATGCAGTAT TATATTAATG TGATGAATGA AACTTAAGTA GTGATAAAAG 1260 

GAAGTTTAGC ATAAATTATA GCAGTTTTCT GTTATTGCTT AATTTACCAT CTCCATAGTT 1320 

TTATAGCTAC TATTOTATTT CACTrGTTGA ATTAAAGTAT TTGAATTCTT TTAAAAAAAA 1380 
AA 



35 



45 



55 



60 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENC3TH: 872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GGGCTACTTC AAAGCCCTOG GOCTTATTTC TTCAGGTAAA AAAATATAAA GTCAGATCTC 
ATCCCGC3CTG GCCATGCTGT TAGACCCTTT CATCCTTCTC TTCTGCCTCT TCTCAACAGC 
TGCCCAGTCC TOTTTGGAAT TCATATACAT ACAGTTCTAA TACTGATGTA TTTACCCTCA 
TAAGCCACTC AACCCAGAAT CTTATTTGAA TTATAATCCA GAAACATCAG GTGACGTGTG 
AGACTACTGT ATGAGAAAGA GACAGTTTAA GGGTCAGTCC AATGGAAAAA AGAGTTCTCA 
GAGCnTdT TAGCTTATTC TCATCAAAGA GCTTTCTCTG CAGAAGGAAC CTACTGGTTC 
CTCCTTTCCA GTCCTAGAAA TCCTGACCTA GAGPTGGCTTA ATCCTGCTAG CACCTCTCTC 
TCGCACTCTG GTGCCAAATG ACTCCAGGAA CTGGGCCATG ATGTGGTGGG AATGACCTTA 
CCCTGAGCAT GTCACTCATG CATTGAACAA CAGCTAAGAG CAGAGCTTAG AGCTTAGAGC 
TGGGCCCTCT AAGGTCAGAG GAATCACATC CTGCAGAAGT CTGTCCTGAG AAGCAGGTAC 
TCCTGTCACA GCAGAGACAC AGTGGATACC TGAGTAACAA TAATACAAGA , GAGGACGTGG 
GMACAGCAAA AGATTTGGGT GTCAGAAGAR GCCGAGAACA CTTYCAGGCA GGAACATTCA 
RARTTCTTCT TGGAGGAART AGGCMCSAAG GCTQGGCAGG ATTTCMCGGG GCAGAGATQG 
AGCAAGCAAT TCAAATGAAA GCCATGGCAT GGGAAAAGGA GCACTGGCCA CAGGGAGTQC 
AACGTTGT6A TGCAAGGCCA CTGTGGAGCC AT 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dotable 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GGCAGAGGCT CACCCCAGCA GAGATTGAGG GGGAACOGTG ATGAAATTTT TAAGTATTCT 
GCPTQATCAT AATAATTTTY CTCTTATGTT AATGTTGGCT CCGTTTGGGrT GTTTAGCTTT 
TGAAAGGAGT ATCAAAATGC GGAATGQGGC TTTGGGQCTT GAGGAGGTGT GATCTCTAGT 
CTTTAAAAAA TTTAATTGCA CAAATAGAAA TAATTCACCC ACATTATTGA ACCCCACTAA 
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AGCATATCCT TTTTOTCCAT ATTCCTTTCC TGCTGCCCTC GTGTGTACCA TTATTACTCA 300 

GrrcTGATTT GAGCTCC?rTC CACTTAAAGT CATTCATAGA TACTTTTGCG TCGrTGTTKGA 360 

5 

ATATTTATTG AATTTCTATT CTGTGTTTTA CTTAATTACT ITATTATCGA ACCTTTACAC 420 

AGGTCTGGTG TACTTGTTCT TTGAAAAGTC TTATGTTGAC CACCATCACT GAGCATATAG 480 * 

10 crrrrrccTT atttccttcg gataattacc cgaagtggaa ataccgaatc aaacttctgt 540 

'm X.T ri 'CTT TGGCACTATT ATATAAATTG TTTTCCAAAC AAGGCATGTT TACAATAGAC 600 

ATTTTTCAAA ATCTOGGTAT TTGTCCTATT TTGCTCTCTC TATGCAGAAT TCAGCGGQGT 660 

GCCAAGTCGT TTTCTCTGTG GGTTGAGAGA CAQGCTGTGC AGCCCACTGT TGCATAQGAC 720 

TAACTACTAC AAATCATGCT GAGACCGAGC TATTTTTGCT GCTTAGARGC TTTGCAGCCT 780 

20 TGAGTAAGTT T06NCATCTG GAAACNTTGN AA 812 

25 (2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1515 base pairs 

(B) TYPE: nucleic acid 
.30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

35 AATTCGGCAC GAGGGAAATT CAAGCACTTT TCCTAAAAGA AGGGGGAATG GATGCTGAAA 60 

CAACACGTMT CCCACAAAGG GAGCAGACAC TGGGCTTGTG AAGCTGCCCC ATACCTTCCC 120 

CACAGAACTC GGGTCCGGCC TCCCTGACAT GCAGATTTCC ACCCAGAAGA CAGAGAAGGA 180 

GCCAGTGGTC ATGGAATGGG CTGGGGTCAA AGACTGGGTG CCTGGGAGCT GAGGCAGCCA 240 

CCCTTTCAGC CTGGCCAGCC CTCTGGACCC CGAGGTTGGA CCCTACTGTG ACACACCTAC 300 

45 CATGCG6ACA CTCTTCAACC TCCTCTGGCT TGCCCTGGCC TGCAGCCCTG TTCACACTAC 360 

CCIX?rcAAAG TCAGATGCCA AAAAAGCCGC CTCAAAGACG CTGCTGGAGA AGAGTCW3TT 420 

TTCAGATAAG CCGGTCCAAG ACCGGGGTTT GGTGGTGACG GACCTCAAAG CTGAGAGTGT 480 

GGTTCTTCAG CATCGCAGCT ACTGCTOGGC AAAGGCCCGG GACAGACACT TTGCTGGQGA 540 

TC3TACTOGGC TATGTCACTC CATGGAACAG CCATGGCTAC GATGTCACCA AGGTCTTTGG 600 

55 GAGCAAGTTC ACACAGATCT CACCCGTCTG GCTGCAGCTG AAGAGACGTG GCCGTGftGAT 660 

GTTTCAGGTC AOGGGCCTCC AOGACGTGGA CCAAGGGTGG ATGCGAGCTG TCAQGAAGCA 720 

TCCCAAGGGC CTOCACATAG TGCCTCQGCT CCTGTTTGAG GACTGGACTT ACGATGATTP 780 

60 



40 



50 
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CCX3GAACGTC TTAGACAGTG AGGATGAGAT AGAGGAGCTG AGCAAGACCG TGGTCCAGGT 840 

GGCAAAGAAC CAGCATTTCG ATGGCTTCGT GGTGGAGGTC TGGAACCAGC TGCTAAGCCA 900 

GAAGCGCGTG ACCGACCAGC TGGGCATGTT CACGCACAAG GAGTTTGAGC AGCTGGCCCC 960 

CX3TGCTOGAT GGTTTCAGCC TCATGACCTA CGACTACTCT ACAGCGCATC AGCCTGGCCC 1020 
TAATGCACCC CTGTCCTCGG TTCGAGCCTG CXSTCCAGGTC CTGGACCCGA AGTCCAAGTG ' 1080 

GCGAAGCAAA ATCCTCCTOG GGCTCAACTT CTATGGTATG GACTACGCGA CCTCCAAGGA 1140 

TGCCCGTCAG CCTOTTGTCG GGGCCAGGTA CATCCAGACA CTGAAGGACC ACAGGCCCCG 1200 

15 GATGGTGTGG GACAGCCAGG YCTCAGAGCA CTTCTTCGAG TACAAGAAGA GCCGCAGTGG 1260 

. GAGGCACGTC GTCTTCTACC GAACCCTGAA GTCCCTGCAG GTGCGGCTGG AGCTGGCCCG 1320 

GGAGCTGGGC GTTGGGGTCT CTATCTGGGA GCTGGGCCAG GGCCTQGACT ACTTCTACGA 1380 

20 

CCTCCTCTAG GTGGGCATTG CGGCXTTCCGC QGTGGACGTG TTCTTTTCTA AGCCATGGAG 1440 

TGAGTCAGCA GGTGTGAAAT ACAGGCCTTC ACTCCGTTAA AAAAAAAAAA AAAAAAAAAA 1500 

25 AAAAAAAAAA AAAAA 1515 



30 (2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 704 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEMESS: double 

(D) TOPOUX3Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

40 AAGATGGTGG CGCCCAGAGC TTCGCTCTAT GCTGCTCCCC TGAGAGAGGC GTTTCCATCA 60 

ACCAGTTTTG CAAGGAGTTC AATGAGAGGA CAAAGGACAT CAAGGAAGGC ATTCCTCTGC 120 

CTACCAAGAT TTTAGTGAAG CCTGACAGGA CATTTGAAAT TAAGATTGGA CAGCCCACTG 180 

45 

TTTCCTACTT CCTGAAQGCA GCAGCTGGGA TTGAAAAGGG QGCCCQGCAA ACAGGGAAAG 240 

AGGTGGCAGG CCTGGTGACC TTGAAGCATG TGTATGAGAT TGCCCGCATC AAAGCTCAGG 300 

50 ATGAGGCATT TGCCCTGCAG GATGTACCCC TGTCGTCTGT TGTCCGCTCC ATCATCGGGT 360 

CTGCCCGTTC TCTQGGCATT CGCGTGGTGA AGGACCTCAG TTCASftAGAG CTTGCAGCTT 420 

TCCAGAAGGA ACGAGCCATC TTCCTGGCTG CTCAGAftGGA GGCAGATTTG GCTGCCCAAG 480 

55 

AAGAAGCTGC CAAGAAGTGA CCCTTGCCCC ACCAACTCCC AGATTTCAAA GGAGGTAGTT 540 

GCAAAAGCTG TGCCCAAGGG GAGGAAGGAG GTCACAOCAA TATGATGATG GTTTrCATGA 600 

60 CTTTGAATGA TATATTTTTG TACATCTAGC TGTATCGAGG CATCAGGCCT GAATAAACAT 660 
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CCTTTCTTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 704 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SBQUE3SK:E CHARACTERISTICS: 
10 (A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

GGCAGCTTTC TTACAAACCC ATCCTTCTGA AATGTTGCTT CAAATTCATC CTCTGCTCCC 60 

CAGTOCCACT ATTCCACACA TACTGTTACT GTTTCTTTAT CCTACTTTCT CAATTTTGGA 120 

20 

ACATAGTTGC AGTTACTGCA TTGAATACCT GTGGGTTTGC CTGTTGTTCT GTCTGTCTCT 180 

GTGGTTCTTG TAATANTGGA TCCCAGAGAT AAAATGGACA GTTGTNATGC ACAGTTAATT 240 

25 CAGAAACTAG ACCTTAeTTG CTGTGTGAAA TACCAACTAA ATTCTCAGTG AACTCAGCTG 300 

ANCTTTATCT CCTTTTGTTT CCCCAATTTA TAATTTCAGT TCAGGCCCAG AAAGATGGAA 360 

TCCCAGCTAA GAAATACAAG TTACACCCTG TACTAGCAGC CCATGTGTGC ATGTTCTTTA 420 

30 

AGTGCTCTTG CAGCTATGTC ATTTATATTG ATTTCCCTGT ATTATTATAA GCAAAGCAAA 480 

TTTGAQGAAA AAAACCCATA ATACCACACC TCATTTTTTT CAAGTAATAG GGTCATAAGT 540 

35 CTCATYCTYC ATATAATATG TTGAGTATGC AGTATATTAT GTGTTAGGCT CTGGANAGGC 600 

AGAGGTTAGA TCATGTWACA GATQ^TATCK GATTAGGCAG ATAAACAGTA TTTTAACCTT 660 

TTCCTTATTA TATGTAACTT GCTTTCAGGT TTTTTAATGT TACTATTATG TCTTTAATAT 720 

40 

ATTATCTTTA TTTGTACTTT TGTATACAGA GTGATTTTCC TTTTTTAAAA AAAATTGTGT 780 

CTTTAGGATG GATTCCAAAG ATGTGGAATC AGTAGGTTTA AGGAATATGG ATATTTTGGC 840 

45 TGGCAAQGTG GCTCACACCT GTAATCCCAG CACTTTGGGA GGCTGAGGTG GGTGGATCAC 900 

CTGAAGTCAG GAGTTCGAGA CCAGCCTGAC CAACATQGCG AAACCCTGTT TMTACTAAAG 960 

ACACACWWAA AATTRGCCAG TGGTQGTGGC ATGTGCTTGT AGTCCCACTT AGCTACTCGA 1020 

GAGGCTGAGG CAGGAGAATC GCTTGAACCC GGGAGOCAGA GGTTGCAGT6 AGGCAAGATG 1080 

GCACCTCTAC ACTC 1094 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 43: 
60 (i) SEQUENCE CHARACTERISTICS: 
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•A) LZrJSTH: 1321 base pairs 
'3) TYri: nucleic acid 
;C) ST?J^^1DEI»5HSS : double 
:d) TCPCLOGY: linear 

5 

'xi) SSQUSXIS r2SCRI?rX0N: SEQ ID NO: 43: 

TCGCTTAGGC CArc;i.CCCTT CCCTTGGCTG GAACTACTGG ACAGACCCTT TTGAGATGTG 60 

10 CCTGTGGCGC TGTSGPjGATG TGTGTAGTGG TCTTAGCTCT TTGTTGAGCT TGTGTGTGTG 120 

ITOTCTAGTC r?A3CTGTAT GCTGAAATTG GGCGTGTGTT GGAGGGCTTC TTAGCTCTTT 180 

GGTGAGATTG rATTTCTATG rGTTrGTATC ASCTGAATGT TGCTGGAAAT AAAACCTTQG 240 
15 • ' 

TTTGIMPiAGG CTCiTTTTTG TGGGAAGTAA GTAlGGGGAAA AGGTCTTTGA GGGTTCCTAG 300 

GCTCCTTTGT ACA.-rAGGAA AATGCCTCAA 'AOrCTTGCTT CCCAGCAACC TGGGGCTGC3T 360 

20 TCCCAGTGCC TGG7CCTGCC CCTTCCTGGT TCTTATCTCA AGGCAGAGCT TCTGAATTTC 420 

AGGCCTTCAT TCC.-j3AGCCC TCTTGTGGCC AGGCCTTCCT TTGCTGGAGG AAGGTACACA 480 

GGGCGAAGCT GATGCTGTAC TTGGGGGATC TCCTTGGCCT GTTCCACCAA GTGAGAGAAG 540 

25 

GTACITACTC TTSTACCTCC TGTTCAGCCA GGTGCATTAA CAGACCTCCC TACAGCTGTA 600 

GGAACTACrC TCCCAGAGCT GPdGGCAAGGG GATTTCTCAG GTCATTTGGA GAACAAGTGC 660 

30 TTTA3TAErA OTITAAAGTA GTAACTC3CTA CTGTATTTAG TGGGGTGGAA TTCAGAAGAA 720 

ATTTGAAGAC CAGXTCATGG CJTGGTCTGCA TGTGAATGAA CAGGAATGAG CCGGACAGCC 780 

TOGCTGTCAT TGCTTTCTTC CTCCCCATTT GGACCCTTCT CTGCCCTTAC ATTTTTGTTT 840 

35 

CTCa^TCTAC CACCATCCAC CAGTCTATTT ATTAACTTAG CAAGAGGACA AGTAAAGGGC 900 

CCTCTTGGCT TGATTTTGCT TCTTTCTTTC TGTGGAGGAT . AT ACT AAGTG CGACTTTGCC 960 

40 CTA:rCCTATT TGG?JWVrCCC TAAjCAGAATT GAGTTTTCTA TTAAGGATCC AAAAAGAAAA 1020 

ACAAAATGCr AATGAAGCCA TCAGTCAAGG GTCACATGCC AATAAACAAT AAATTTTCCA 1080 

GAAGAAA7GA AATCCAACTA GACAAATAAA GTAGAGCTTA TGAAATGGTT CAGTAAGGAT 1140 

45 

GASTTTCTTG •rnvriVITT TGTITTGTTT TGKTTTTTTA AAGACGGAGT CTCGCTCTGT 1200 

CACTCAGGCT GGAGTGCAGT GGTATGATCT TGGCTCACTG TAACCTCCGC CTCCCGGGTT 1260 

50 CAAGCCATTC TCCIXSCCTCA GTCTCCTGAG TAGCTGGGAT TACAGGTGCG TGCCACCATG 1320 

CCIGGCTAAT TTrrGTGTTT TTAGTAGAGA CAGGGTTTCA CCATGTTGGT CGGGCTGGTC 1380 

TCAAACTCCT GACCTCTTGA TCCGCCTGCC TTGGCCTCCC AAAGTGATGG GATTACAGAT 1440 

55 

GTGAGCC?.CC CGTGCCCTAG CCAAGGATGA GATTTTTAAA GTATGTITCA GTTCrGTGTC 1500 

ATGGTTOGAA GAC^GAGTAG GAAGGATATG GAAAAGGTCA TGGGGAAGCA GAGGTGATTC 1560 

60 ATGGCTCTGT GAATTTGAGG TGAATGGTTC CTTATTGTCT AQGCCACTTG TGAAGAATAT 1620 
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GAGTCAGTTA TTGCCAGCCT TGGAATTTAC TTCTCTAGCT TACAATGGAC CTTTTGAACT 1680 

GGAAAACACC TTGTCTGCAT TCACTTTAAA ATGTCAAAAC TAATTTTTAT AATAAATGTT 1740 

5 

TATTTTCACA TTCAAAAAAA AAAAAAATTT AAAAACYCGG GQGGGGCCCS GriACCCCATT 1800 

NGCCCCTAAG GGGGGGGGTT T 1.821 

10 



(2) INFORMATION FOR SEQ ID NO: 44: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1024 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GGGGCACAGT TGAAGAAGCG ACCGAGGGAC TGGGAGTCGT TAGTGAGGAT GACGCQGCAT 60 
25 GGCAAGAACT GCACCGCAGG GCCGTCTACA CCTACCACGA GAAGAAGAAG GACACAGCGG 120 
CCTCGGGCTA TGGGACCCAG AACATTCGAC TGAGCCGGGA TGCCGTGAAG GACTTCGACT 180 
GCTGTTGTCT CTCCCTGCAG CCTTGCCACG ATCCTGTTGT CACCCCAGAT GGCTACCTGT 240 

30 

ATGAGCGTGA GGCCATCCTG GAGTACATTC TGCACCAGAA GAAGGAGATT GCCCGGCAGA 300 
'TCAAQGCCTA CGAGAAGCAG CGQGGCACCC GGCGCGAGGA GCAGAAGGAG CTTCAGCGGG 360 
35 CGGCCTCGCA GGACCATGTC CGGQGCTTCC TGGAGAAGGA GTCGGCTATC GTGAGCOGGC 420 
CCCTCAACCC TTTCACAGCC AAGGCCCTCT CGGGCACCAG OCCAGATGAT GTCCAACCTG 480 
GGCCCAGTGT GGGTCCTCCA AGTAAGGACA AGGACAAAGT GCTGCCCAGC TTCTQGATCC 540 

40 

CGTCGCTGAC GCCCGAAGCC AAGGCCACCA AGCTGGAGAA GCCGTCCCGC ACGGTGACCT 600 
GCCCCATOTC AGGGAAGCCC CTGCGCATGT CGGACCTGAC GCCCGTGCAC TTCACACCGC 660 
45 TAGACAGCTC CGTGGACCGC GTGGGGCTCA TCACCCGCAG CGAGCGCTAC GTGTGTGCCG 720 
TGACCCGCGA CAGCCTGAGC AACQCCACCC CCTQCGCTGT QCTGCGGCCC TCTGGGGCTG 780 
TQGTCACCCT CGAATCCGTG GAGAAGCTGA TTCGGAAGGA CATGGTQGAC CCTGTGACTG 840 

50 

GAGACAAACT CACAGACCGC GACATCATCG TGCTGCAGCG QGGCGGTACC GSTTCGCGGG 900 
CTCCGGAGTG AAGCTGCAAG CGGAGAAATC ACGGCCGGTG ATGCAGGCCT GAGTGTGTGC 960 
55 GQGAG/^CAA ATAAACCGGC TTGGGTGCGC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1020 
AAAA 1024 



60 
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(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 983 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTICMJ: SEQ ID NO: 45: 

CGACACGGCT GCGAGAAGAC GACAGAAGGG CCCGACCGCG AGCCGTCCAG GTCTCAGTGC 60 

TGTGCCCCCC CCAGAGCCTA GAGGATGTTT CATGGGATCC CAGCCACGCC GGGCATAGGA 120 

GCCCCTQGGA ACAAGCCGGA GCTGTATGAG GAAGTGAAGT TGTACAAGAA CGCCCGGGAG 180 

AGGGAGAAGT ACGACAACAT GGCAGAGCTG TTTGCGGTGG TGAAGACAAT GCAAGCCCTG 240. 

20 GAGAAGGCCT ACATCAAGGA CTGTGTCTCC CCCAGCX^GT ACACTGCAGC CTGCTCCCGG -300 

CTCCTGGTCC AATACAAAGC TGCCTTCAGG CAGGTCCAGG GCTCAGAAAT CAGCTCTATT 360 

GACGAATTCT GCCGCAAGTT CCGCCTGGAC TGCCCGCTGG CCATGGAGCG GATCAAGGAG 420 

GACCGGCCCA TCACCATCAA GGACGACAAG GGCAACCTCA ACCGCTGCAT CGCAGACGTG 480 

GTCTCGCTCT TCATCACGGT CATGGACAAG CTGCGCCTGG AGATCCGCGC CATGGATGAG 540 

30 ATCCAGCCCG ACCTGCX3AGA GCTGATQGAG ACCATGCACC GCATGAGCCA CCTCCCACCC 600 

GACTTTGAGG GCCGCCAGAC GGTCAGCCAG TQGCTGCAGA CCCTGAGCGG CATGTCGGCG 660 

TCAGATGAGC TGGACGACTC ACAGGTGCGT CAGATGCTGT TCGACCTGGA GTCAGCCTAC 720 

AACGCCTTCA ACCGCTTCCT GCATGCCTGA GCCCGGGGCA CTAGCCCTTG GACAGAAGGG 780 

CAGAGTCTGA GGCGATGGCT CCTGGTCCCC TGTCCGCCAC ACAGGCCGTG GTCATCCACA 840 

40 CAACTCACTG TCTGCAGCTG CCTGfTCTGGT GTCTGTCTTT GGTGTCAGAA CTTTTGGGCC 900 

GQGCCCCTCC CCACAATAAA GATGCTCTCC GACCTTCAAA AAAAAAAAAA AAAAAAAAGR 960 

KGSGGCCGGT CCCCANTCCC CCC 983 

45 



25 



35 



50 



(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECSIESS: double 
55 (D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
CCGGCTCATC GCTGCCGCTC CGCCAATACA ATAGAGCCAK CCACTACCAG CAGCCTGGCC 60 



wo 98/54963 



305 



PCTAJS98/n422 



CTCTTCCTCC TTCTCCAGAG AGACCAATCC 
GAGGAAGTGA CCATGGACAC AAGTGAAAAC 
5 ATGCCTATTG CAGACCAAGT CAGCAATGAT 
GAGAAGAAAG AGAGCTCGCT GCCCAAATCA 
ACCAAGGGGG TGCXMCTGG AAACAGTCAC 

10 

CGCTGGQGAG CCAGCACAGC CACCACACAG 
TCACTAAAGA GCCTCATCCC CGACATCAAA 
15 CTTCATGCTG ATGACTCTCG CATCTCTGAG 
ACCCATGACA AGGGGCTGAA AATATGCCGG 
CAGGAGAATG GGCAGAGGGA AGAAGAGGAA 

20 

GTACCTCCCC AGGTGTCAGT AGAGG?rGGCC 
AAAGTGACTT TAG6AGATAC CTTAACTCGA 
25 TCCATTACCA TTGATGACCC AGTCCGAACT 
ATTAGCAACA TTGTCCftTAT CTCCAATTTG 
GAGTTGTTGG GGCGCACAGG AACCTTGGTG 

30 

TCTCATTGCT TTGTAACGTA CTCAACAGTA 
CACGGGGTCA AATGGCCCCA GTCCAATCCC 
35 GATGAGCTGG ATTATCACCX5 AGGCCTCTTG 
GAGCAGGGAA TACCACQGCC CCTGCACrCC 
CACCCCCGGG CAGAGCAGCG GGAGCAGGAA 

40 

GAACGG6AAA TGGAGCGGCG GGAGCGGACT 
GTTCGAGAAG GGCCCCGTTC CCGATCAAGG 
45 AAGTCTAAAG AAAAGAAGAG TGAGAAGAAA 
CTGCTGGATG ACTTTTTCCXS AAAGACCAAG 
ACTTGACAGCC AGATCGTTCA GAAAGAGGCA 

50 

AAGCX3GCGAA AGGAGCAAGA AGAAGAAGAG 
GAACGGAACC GACAGCTGGA GOGAGAGAAA 
55 GAGAGAGAGA GAGAAAGGGA GCGGGACAGG 
CGAGAACGAG GCAGGGAAAG GGATCGCAGG 
CGGAGCACAC CTGTGCGGGA CCGGGGTGGG 

60 



AGCCX3AACTC GGGGTTTQCC TGAGGAGAAG 120 

AGACCTGAAA ATGATGTTCC AGAACCTCCC 180 

GACCX3CCX:GG AGGGCAGTGT TGAAGATGAG 240 

TTCAAGAGGA AGATCTCCGT TGTCTCAGCT 300 

ACAGAGGGGG GCCAGCCTGG TCGGAAACGA .360 

AAGAAACCTT CCATCAGTAT CACCACTGAA 420 

CXrCCTGGCGG GGCAGGAGGC TGTTGTGGAT 480 

GATGAGACAG AGCGTAATGG CGATGATGQG 540 

ACAGTCACTC AGGTAGTACC TGCAGAGGGC 6Cio 

GAAGAGAAGG AACCTGAAGC AGAACCTCCT 660 

TTGCCCCCAC CTGCAGAGCA TGAAGTAAAG 720 

CGTTCCATTA GCCAGCAGAA GTCCGGAGTT 780 

GCCCAGGTGC CCTCCCCACC CXTGGGGCAAG 840 

GTCCX3TCCTT TCACTTTAGG OCAGCTAAAG 900 

GAAGAGGCCT TCTGGATTGA CAAGATCAAA 960 

GAGGAAGCTG TTGCCACCCG CACAGCTCTG 1020 

AAATTCCTTT GTGCTGACTA TGCCGAGCAA 1080 

GTGGACCGTC CCTCTGAAAC TAAGACAGAG 1140 

CXaXrCCCXIAC CCCCGGTCXA GCCACCACAG 1200 

CGGGCAGTGC GGGAACAGTG GGCAGAACGG 1260 

CGATCAGAGC GTGAATGGGA TCGGGACAAA 1320 

TCCXOTRACC GCCX3CCGCAA GGAACGTGCG 1380 

GAGAAAGCCC AGGAGGAACC ACCTGCCAAG 1440 

GCAGCTCCCT GCATCTATTG GCTCCCACTG 1500 

GAGCQGGCCG AACGGGCXJA GGAGCGGGAG 1560 

CAAAAGGAGC GGGAGAAGGA AGCCGAGCGG 1620 

CGTOGGGAGC ACAGTCGGGA GAGGGACAGG 1680 

GGGGACOGAG ATGGGGATAG GGAAAGGGAC 1740 

GACACCAAGC GCCACAGCAG AAGCCGGAGT 1800 

CGCCGCTAGC TGGGAAAACA CTAGAGCTGC 1860 



wo 98/54963 



PCT/US98/11422 



306 



AGGTACCAGC CACTCGGCCC CAC5GGGGTTA TGGCCACAGA GQGATAGGC^ CAGTCTCCAC 1520 

CACCCTGGAG CCAAGGGTCT TTCACATCAC CTATCCCTAC ATACATACC^ AATGGAAAAG 1530 

5 TGGCCATCCT TTTCCCCCCA AACACACCCC CTTAACCTAT CTCTTGGGAC TTAGCCCGAC 2340 

CCTCCCICTC ATTTCCCATT A7«3TCTGAGA GGCAAGAGCT AGGTTAGGC^ AGGAGGTGG7 2100 

TGGCCAGAGA TOGGGAACAG CCAGGTGCCC CAGTCCTCTG ATTTTTCCTC CATCCTGCTT 2160 

10 

ACCACCrcCC TOGGTACTTA CAQCXTTTCTC TTGGGAACAG CCGGGGCCAG GACTGGGTCA 2220 

CCTATGAGCT GAATCAGCAT CTCCTCCTGA GTCCCAGGGC CCCTGCAGTT CTCAGTCTCT 22B0 

15 TCTGTCCTGC AGCCCTTGCC TCTTTCCCAC AGGTTCCACT TTATATCCAC CTTTTCCTT? 2240 

TGTTCAATTT TTATTTTTAT TTTTTTTATT ATTAAATGAT GTGGTCTATG GAAAAAAAAA 2400 

TAAAAATCTG AjCTTAGTTTT A 2421 

20 



(2) INFORMATIW FOR SEQ ID NO: 47: 

25 

(i) SEQUENCE CHARACTERIStlCS : 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 <D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CTCAAACTCC TGAGCTGAAG CGATCTACCT GCCTCAGCTA GGATTACAGG TGTGAGCC=..C 50 

35 

CGCACCCAAC CTCAATAAGC KTATTTGATA AAAKATATGC AAGCTCCCTT TATKCACTT? 120 

TCATTCAGAA TGTTTAGTAA TTTGTATTGT TTTTCAGATT TTCAGCCCAA TATATCTCC/ 130 

40 TGCCCACTGT GTCACTGTAT TCTACCTAWA CATCATCACG TGTTTCTGCT ATTQGCTGTA 240 

TGATQGAACA CTGCGGCTCA T T TTCCTGAA AACTGCCGAT AGTGCATAGA RTGCTGGGAT 300 

GGAAACCAGA ARCTTTGAAT TCAAGCCTTG GTTCTGCCTT GTTTTTGCTT GGGTGGCCTT 360 

45 

GAGTCAGCCA CATACCTTTT AAAATCTCAA TTTATTAGAA ATTATTCCAA ATCAAAATCA 420 

AATGAGAAGG TATATACAAA AGTGCnTAT CCCACAATAA ACTATTCAM AGAGAGCAAA 480 

50 GGAGAGGACA TTTACTCAAC ACCTCCTAAA AGGCAGCCAG TGAAATTAGG CATTTTATTT 540 

AATCCTCCTG GCAACTCTGA GAGTAAAGCA TTATTAATCC CATTTTGGCT GTTTAAAGftA 500 

ATTATTTQCA CTAGATTCCA GCTGTAGTTT AGYTTCAGAA AAAAAAATCC TGAGATGTGA 560 

55 

ATTCACAQCT TTCTQGGTTT AAAGCCCAAG CTCTATCACA TCATGCTATT ATTGTTACAT 720 

TACTGCTAGT TCTATGAAAA GAAATACTAA TTTATGAAAT ACATCTTATC GAAAAAAAAA 780 

60 AAAAAAAAAC TGGGAGGGGG GGCCCGTACC CAAATCGCCG GATAGTGATC GTAAACAATC 340 
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30 



40 



50 



5 (2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2432 base pairs 

(B) TYPE: nucleic acid 
10 {€) STRANDEDNESS: double 

(D) TOPOrXXSY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

15 GGCACGAGGC CCGGAACGCT GAGGAAGGGC CCGTCCCGCC TTCCCCGGCG CGCCATGGAG 60 

CCCCGGGCQG TTGCAGAAQC CGTGGAGACG GGTGAGGAGG ATGTGATTAT GGAAGCTCTG 120 

CGGTCATACA ACCAGGAGCA CTCCCAGAGC TTCACGTTTG ATGATGCCCA ACAGGAGGAC 180 

CGGAAGAGAC TGGCGGASTG CTGGTCTCCG TCCTGGAACA GGGCTTGCCA CCCTCCCACC 240 

GTCTCATCTG GCTGCAGAGT GTCCGAATCC TCTCCCGGGA CCGCAACTGC CTGGACCCGT 300 

25 TCACCAGCCG CCAGAGCCTG CAGGCAYTAG CCTGVTATGY TGACATCTCT GTCTCTGAQG 360 

GCTCCGTCCC AGAGTCCGCA GACATGGATG TTGTACTGGA GTCCCTCAAG TGCCTGTGCA 420 

ACCTCGTGCT CAGCAGCCCT GTGGCACAGA TGCTGGCAGC AGAGGCCCGC CTAGTGGTGA 480 

AGCTCACAGA GCGTGTGGGG CTGTACCGT6 AGAGGAGCTT CTCCCACGAT GTCCAGTTCT 540 

TTGACTTCCG GCTCCTCTTC CTGCTAACX3G CACTCCGCAC CGATGTGCGC CANAGCTGIT 600 

35 TCAGGAGCOXS AAAQGAGTCC GCCTGCTAAC TGACACACTG GAGCTGACGC TGGGGGTGAC 660 

TCCTGAAGGG AACCCCCCAC CCACGCTCCT TCCTTCCCAA GAGACTGAGC GGGCCATGGA 720 

GATCCTCAAA GIGCTCTTCA ACATCACCCT GGACTCCATC AAGGGGGAGG TGGACGAGGA 780 

AGACGCTGCC CTTTACCGAC ACCTGGGGAC CCTTCTCCGG CACTGTGTGA TGATCGCTAC 840 

TGCTGGAGAC CGCACAGAGG AGTTCCACGG CCACGCAGTA ASCCTCCTGG GGAACTTGCC 900 

45 CCTCAAGTOT CTGGATGTTC TCCTCACCCT GGAGCCACAT GGAGACTCCA CGGAGTTCAT 960 

GGGAGTGAAT ATOGATOTGA TTCGTGCCCT CCTCATCTTC CTAGAGAAGC GTTTGCACAA 1020 

GACACACAGG CTCAAGGAGA GTGTAGCTCC CGTGCTGAGC GTQCTGACTG AATGTGCCCG 1080 

GATGCACCGC CCAGCCAGGA AGTTCCTGAA GGCCCAGGTG CTGCCCCCTC TGCGGGATGT 1140 

GAGGACACGG CCTGAGGTTG GGGAGATGCT GCGGAACAAG CTTGTCCGCC TCATGACACA 1200 

55 CCTOGACACA GATGTGAAGA GGGTGGCTGC CGAGTTCTTG TTTGTCCTGT GCTCTGAGAG 1260 

TCTCCCCCGA TTCATCAAGT ACACAGGCTA TGQGAATGCT GCKXXXTTC TGGCTGCCAG 1320 

GGGCCTCATG GCAGGAQGCG GCCCGAGGGC AGTACTCAGA GGATGAGGAC ACAGACACAG 1380 

60 
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ATCAGTACAA GGAAGCCAAA GCCAGCATAA ACCCTGTGAC CGGGAGGGTG GAGGAGAAGC 1440 

CGCCTAACCC TATGGAGGGC ATGACAGAGG AGCAGAAGGA GCACGAGGCC ATGAAGCTGG 1500 

TGACCATOTT TGACAAGCTC TCCAGGAACA GAGTCATCCA GCCAATGGGG ATGAGTCCCC 1560 

GGGGTCATCT TACGTCCCTG CAGGATGCCA TGTGCGAGAC TATGGAGCAG CAGCTCTCCT 1620 

CGGACXTCTCA CTCGGACCCT GACTGAGGAT GGCflGCTCTT CTGCTCCCCC ATCAGGACTG 1680 

GTGCTCCTTC CAGAGACTTC CTTGGGGTTG CAACCTGGGG AAGCCACATC CCACTGGATC 1740 

CACACCCGCC CCCACTTCTC CATCTTAGAA ACCCCTTCTC TTGACTCCOG TTCTGTTCAT 1800 

15 GATTTCCCTC TGGTCCAGTT TCTCATCTCT GGACTGCAAC GGTCTTCTTG TGCTAGAACT 1860 

CAGGCTCAGC CTCGAATTCC ACAGACGAAG TACTTTCTTT TGTCTGCGOC AAGAGGAATG 1920 

TGTTCAGAAG CTGCTGCCTG AGQGCAGGGC CTACCTQGGC ACACAGAAGA GCATATGGGA 1980 

20 

GGGCAGGGGT TOGGGTGTGG GTGCACACAA AGCAAGCACC ATCTGGGATT GGCACACTGG 2040 

CAGAGCMANT GTKTTGGGGT ATGTGCTGCA CTTCCX:AGGG AGAAAACCTG TCAGAACTTT 2100 

25 CCATACGAGT ATATCAGAAC ACACCXTTTCC AAGGTATGTA TGCTCTGTTG TTCCTGTCCT 2160 

GTCTTCACTG AGCGCAGGGC TGGAGGCCTC TTAGACATTC TCCTTGGTCC TCGTTCAGCT 2220 

GCCCACTGTA GTATCCACAG TGCCCX5AGTT CTCGCTQGTT TTGGCAATTA AACCTCCTTC 2280 

30 

CTACTGGTTT AGACTACACT TACAACAAGG AAAATGCCCC TCGTGTGACC ATAGATTGAG 2340 

ATTTATACCA CATACCACAC ATAGCCACAG AAACATCATC TTGAAATAAA GAAGAGTTTT 2400 

35 GGACAAAAAA AAAAAAAAAA AAAAAAAAAA AA 2432 



40 (2) I^3F0RMATI0N FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1742 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) T0P0LCX3Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

50 GTCCTGCAQG AQCTGCACGC GGCCGAGGTG CGCANGAACA AGGAGCAGCG AGAAGAGATG 60 

TOGGGCTAAG GGCCCGGSAC GRGSGGCGCC CATCCTGCGA CGGAACACGT TCGQGTTTTG 120 

GTTTTGTTTC GTTCACCTCT GTCTAGATGC AACTTTTGTT CCTCCTCCCC CACCCCflGCC 180 

55 

CCCAGCTTCA TGCTTCTCTT COGCACTCAG CCGCCCTGCC CTGTCCTCGT GGTGAGTCGC 240 

TGACCACGGC TTCCCCIGCA GGAGCCGCCG GGCGT6RAGA CGCGGTCCCT CGGTGCASAC 300 

60 ACCAGGCCGG GCGCGGCTGG GTCCCCCGGG GGCCCTGTGA GAGAGGTGGY GGTGACCGTG 360 
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GTAAACCCAG QGCXXnXSGCG TGGGATCRCG GGTCCTTACG CTGGGCTGTC TGGTCAGCAC -420 

GTCCAGGTCA GQGCAGGTCC TCTGAGCCGG CGCCCCTGGC CAGCAGGCGA GGCTACAGTA 480 

5 

CCTQCTCTCT TTCCAGGQGG AAGGGGCTCC CCATGAGGRA GGGGCGACGG GGGAGQGGGG 540 

TCATGGTCCC TGGGAAGCCT GCKTGTGCAN CCGGTGCTTG TTGAACTGGC AGGCX3GGTGG 600 

10 GTGGGGGCTG CAGCTTTCCT TAATGTGGTT GCACAGGGGT CCTCTRAGAC CACCTGGCGT 660 

GAGGTCGACA CCCTGGGCCT TCCTGGAAGC CTGCAGTTGG GGGCCTGCCC TGAGTCTGCT 720 

GGGGAGTCGG CATTCTCTCC CAGGGACCCA TGAGCAGGCT GCATGGTCTA GAGGTTGTGG 780 

GCAGCATCGA CAGTCCCCCA CTCAGAAGTG CAAGAGTTCC AAAGAGCCTC TGGCCCAGGC 840 

CCCTCCGTGG GACAGCCCCG CCGCCCCTCC CCACCAQGGC TTTGCAGATG TCCTTGAAAG 900 

20 ACCCACCCTA GAGCCCTTTG GAGTGCTGGC CXrCTCCTGTG CCCTCTGCCC TGGTGGAAGC 960 

GGCASCACAA GTCCTCCTCA GGGAGCCCCA AGGGGGATTT TKTGGGACCG CTGCCCACAG 1020 

ATCCAGGTGT TGGAAGGGCA GCGGGTAAGG TTCCCAAGCC AGCCCCAACA CCCTTCCCAC 1080 

TOGGCACCCA GAGGGGGCT6 TGGGTGGAGG CCTGACTCCA GGCCTCTCCT QCCCACACCC 1140 

TCTX3GGC1GA GTTCCTICTT TCCCTTQGAC GCCCAGTGCT GGCCTTGGAG GACGGTCAGC 1200 

30 TGGAGGATCG CQGTCGGGGA GGCTGTCTTT GTACCACTGC AGCATCCCCC ACTTCTCCAC 1260 

GGAAGCCCCA TCCCAAAGCT GCTGCCTGGC CTCTTGCTGT AAAGTGTGAA GGGGGCQGCT 1320 

GAGTTCTCTT AGGACCCAGA GCCAGGGCCC TCAACTTCCA TCCTGCGGGA GGCCTTGGCC 1380 

GGGCACIGCC AGTGTCTTCC AGAGCCACAC CCAGGGACCA CGGGAGGATC CTGACCCCTG 1440 

CAGGGCICAG GGGTCAGCAG GGACCCACTG CCCXIATCTCC CTCTCCCCAC CAAGACAGCC 1500 

40 CXMAAQGAG CAGCCAGCTC OGATGGGAAC CCAAGGCTGT CCACATCTGG CTTTTGTQQG 1560 

ACTCAGAAAG GGAAGCAGAA CTGAGGGCTG GGATATTCCT CATGGTGGCA GCGCTCATAG 1620 

CGAAAGCCTA CTOTAATATG CACCTATCTC ATCCACGTAG TAAAGTGAAC TTAAAAATTC 1680 

AATCAAATGA ACAATTAAAT AAACACCTGT GTGITTAAGA AAAAAAAAAA AAAAAAACTG 1740 
CG 



25 



35 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 50: 



55 (i) SEQUENCE CMARACTERISTICS : 

(A) LENGTH: 1487 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

60 



1742 
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(Xi) SEQUENCE DESCRIPTICW: SEQ ID ISFO: 50: 



GGCACGAGCC TCCGCGAACT GTGGAGTCGG CGGAGGGCTG GAATCAGCGT GGGCTCCAGG 60 



5 TCGCTX3GCAG CCGGGTGGCA GAACTCTTCC GAGGCTCCTT GGGAAGAAGC TACACCCGAG 120 



GGAGCCGGAT GQGCCTCGAA AACCTGGCCC GCTCTGGTTC TGTACCATTG CAAGGGGAAC 180 

CGTAAACTGA GCTTTTCTAA CGTGGGTTTC TGCCAAGTAC TTTTCCAGCT GCCCCCTTCC * 240 

10 

CXXXMCACA CAGGAGAGCC TCTGTGTAGC CAGCXKrTTGA CAGTCGTTAG GTAGGTTGTA 300 



CTOTGTAGGG AGGAGCTCAA GATCATGAAT GGTTGTCACA GGAGAAAGCG GTTGCATCTr 360 



15 TGCAAAACTA TATACCTGCT GTGGTTTGTG TTTTCTTTTC TGCTGAGTAA TGAAGTTGTA 420 



AGTTCACACT GGCACATTCT CAGGGCTGTG CAGATTATTT GCACTTTATT TCATAGGTGR 480 



ATAAOTGCTT TTTAGCTTTC TTTGTATATT GAGTTGCTTT TGAATTGCTT CCCATATTTT 540 

20 

TATTTCATAC AAACTGAACA ATTGTGGCCC CTCTATTTTA TTTATAAAGG TTCAGTGTAT 600 

CTTTGCCTGC CTACATCAAT CTGCAAGGGA GTTGCAGAAA GCCTCATGTT CATCGAGCCG 660 

25 TGAGTCACAA CXAATTTCTA AGCTGTTATA ACAAAAAAGT GTTTGCTTTT TTTCACAAGT 720 

AACTTTAAAA GTGTAGTTTA GAAAGAAAAC ATTTTCAATA AAAAGACACT ACATTAATCC 780 



TGGATGCTTG CAAATCCTAA AAIMTATTCC TCCTCTAGCX3 TTGCACAGCT CTGTGTTGTA 840 

30 

TACACAGACT AGCTTTAAAA TTTGTCACAT ACCACTTTAC CTTTACTTTT ATGTATCATT 900 



CCX:CX3GACTT CXTTTACTGCA GGTGTGGGCA AGAAAACTTT TCCTTTAACA CTTTTCAACA 960 
35 GCQGGCATAA AATTCTGCAG CTGAGGTCTT GAAGAATGCA GATGGGTACA GTATCTGTTG 1020 



GAGCTCACAG TGTGTATTGA CTAACCTAGT TCCTTTTTTG CrrTTTTTGG TATTGTCTTG 1080 

TEAAAAGTGA CTCXCAGGTA GCAACTCTCT TTTTTAAGGG TQGGAACGAA AGGGACGTAG 1140 

40 

GAAGAATAGA TCTAGATTAT TTAACAGTCT TCGATAGAGT TTGAAAGCTT TCTTCTTCAT 1200 



TCAATTTTGG GCAAAATACT GCCTCTGCAT TTGTTCATAA CAAAAAGA1T AGATTAATAA 1260 
45 GTAGCTTTTG TTGGTGGAAA TTACCAGCTC TATAAGTCAC CCTTGGTGGT TCATGGACCT 1320 
CTGATTAGCT TGGCSTTTTGC AGTCTCATTG CCACATGTAT ATGTGGAGCX: AATGGCCTTT 1380 



TQGTOCTCAG CTGTrTACGT CTGACTCCTT GACTTCTTTG GTACAGTGAT GGAGTCAGAT 1440 

50 

CTCATTAAGT GTGATTCTCC ATOGATATAA CCAGCCCCAA AAAAANG 1487 



55 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1328 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 51: 

5 

GGCACGAGCT CGTGOCGAAT TCGGCACGAG AGAAGATTTG AAGAAGCCAG ATCCAGCTTC 60 

CCTGCGGGCT GCTTCITCTG GGGAAGQGAA AAAGAGGAAG GCCTGTAAGA ACTGCACCTG 120 

10 TGGCCTTGCC GAAGAACTGG AAAAAGAGAA GTCAAGGGAA CAGATGAGCT CCCAACCCAA 180 

GTCAGCTTCT GGAAACTCCT ACCTGGGCGA TGCCTTCCGC TGTGCCAGCT GCCCCTACCT 240 

TGGGATGCCA GCCTTCAAAC CTGGGGAAAA GGTCCTTCTG AGTGATAGCA ATCTTCATGA 300 

15 

TCCCTAGGAG GTTCCTGACA TGGGACCCAT CTGCTCCTCC AGCCAACTCC TGTCCCTCAC 360 

ATCCCACCAT GGTGGCTCCT CCCACCTCCT CTGGATTTGT TCACTCTGAG ATCTGTTTGC 420 

20 AGAGTCGGTC CTTAGCAGAC AGAGTGAAGC TGGCTGGGGG GCACAGTGGT GTGTAGTGCT 480. 

GCTGTGTATC AAAAGACCAA GGTATTATGG GACCTGGTTT CAGAATGGGA TGGGTTTCTT 540 

CACCTCATGT TAAGAGAAGG GAGTGTGTCC TGAAGAAGCC CTTCTTCTGA TGTTAAAATG 600 

25 

CTGACCAGAA CGCTCTTGAG CCCACGCATC GTTGAGCATT AACACTCTGT GACAGAGCTG 660 

CAGACOCCTG CCTTGAGTCT CATCTCAGCA ATGCTGCCAC CCTCTTGTCT TTCAGAGTTG 720 

30 TTAGTTTACT CCATTCTTTG TGACACGAGT CAAGTGGCTC ACAACCTCCT CAGGGCACCA 780 

GAGGACTCAC TCACTGGTTG CTGTGATGAT ATCCAGTGTC CCTCTGCCCC CTTCCATCCC 840 

CAACCACATT TGACTGTAGC ATTGCATCTG TGTCCTGTTG TCATTTATGT TAACCTTCAG 900 

35 

GTATTAAACT TGCTCCATAT CTTGACATAT CTTGAGATTC TGCATGTCTT GTAAAGAGAG 960 • 

GGGATCTCCA TTTOTGTGTG ATGTTGGATA GTCATCCACG CTCAGTTTGG ACCATTQGAG 1020 

40 GAACTTAGTG TCACGCACAA ATGGGGCTAT TCCTACGCTT AGAATAGGGC TTGTCTGCCC 1080 

ACTTTAGAAG AGTCCCAGGT TQGTGAGCAT TTAGAGGGAA GCAGGGCAGA ACTCTGAACG 1140 

ACAATACGTC TCTCTGAGCA GAGACCCCTT Wri m OTT ATCCACCCAT ATGGACTTGG 1200 

45 

AATCAATCTT GCCAAATATT TQGAGAGATT GrrGTOGATTT AAGAGACCTG GATTTTTATA 1260 

TTTTACCAGT AAATAAAAGT TTTCATTGAT ATCTGTCCTT GAAAAAAAAA AAAAAAAAAA 1320 

50 AAACTCGA ^^28 



55 (2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1856 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: double 
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(D) TOPOLCGY: lir-ear 
(xi) SEQUH^KTE ZESC^lTIZZli : SEQ ZD llCi 52: 

GAATTCGGCA CGAGCTCTGC A^ZXTTSC-A ATa^-JwrTCC ?:^'ZZ::i?'J2C(Tr TCCGCTGCCC 60 

CCTAGATTAA ATTCCXXX3GG CTGAAArT-SA GTTC-CASATT TATPJ-.TATCA TATTTTAAAT 120 

TGCTCTCTTC AATTAAACCA TTTATa-JTCA TAACTAATTT rCASSATCTC GATGCATGCT ' 180 

TTTCCAGGCC TTCCTTCTTT GV^JZ^JJ^X72 A---.T<?rCCAr AAA3CGTTTC ACTTATATTC 240 

TTCAAACATG ATGCTAATTT AAATTA=-:?rA CTICCTATE^. rATSTTATTA TTCCTATGAT 300 

15 TTTGCCACTG TTArTAGTTC TCTCAA-AAr AC-.TCTAGGS .=iA:^=aGGATTA TTTTAACT?A 360 

TTTGATTATC TTTCTATCTC TTTTATTTAr TTCrCATTTA CTTiAGAAAT TCGTTCCATT 420 

GGriTGGCATT GATACAGTAA ATTTGT.-AAr GAGGASACA.2. rATAAAAAAT CTAAATTACT 480 

20 

TGrrOCTTAAT GACTGTAGCA GAATSCCTTT TCTCTAAA'TC A3iZr?3TCTT TCTTGCAGTT 540 

TAGTTTGATA GATTTGCAAG CTATGCTGCT TCC:A"rGAASr TASTIGCGCT GGTAGGAA2G 600 

25 CAGGCTTCTT TGTCrCTGGT TCTAGCTTGC ATG?CC3CCC CATTJ^JSGCAS ACAACGTAGC 660 

CGGAGATCAC AAATCAGGCC CT?GGT:?rA3 TTGCIAS?:?? XTGaP-jSCTGC AGAGAGGTTG 720 

GCAGAAACTG ACXTXACTGG GCAAGGCrTGQ CC-.TGGACCT GAnCTTTAA TGCACTCTAT 780 

30 

GTCTTCAQGA AGCCACAGGC CACATTTGAC TCTGA3AAi3 AAA^lAAGP-iS GAAAAACCCC 840 

ACAAAGTATA ACAACCCCTT AAGATACATC TATTTTAA^^ IGAAATTAA? TrTTCAGTTT '900 

35 ATACCATTGG CCAATTACAA GATAAAAATG TTCAArTTCr rCAASAATCC TrTGTTGACT 960 

TGTCTTTTCA TCTCTTGCTA TTTATAmG TC-.CrCmA3 "CAACAAAJ^T CTTATTTGCT 1020 

GAGGAAGGAC TTTGCTGCAC TTACTGrACC AjCATCAAACA CTGGcGAGGG TGGTGTTTAA 1080 

40 

CTTTTTAAAA AATGTTATTC TGATTATAAT AATAAZATTC- GCTTTTTTCA TGAAAAGACC 1140 

GCCACCTTGC AAGGTTTAGT GAGAirrATG GA-^inTSAAr ACCTAAGCAG GAATTGCTGC 1200 

45 TAGCTCCAAA AATTTGCGAA GCAAAA5CIA GCCCCAATIG CnrrGGAAGT TTGAAACTGA 1260 

TTAACAGATT TGCATTTGAA GTGACTrCAG ACATTAGGCC CA3A2ATTAG TTAAAAATAG 1320 

AAAGAGGAAT AAAGACATCP YTTCTCrCTA GAAAA3ATAA C^TCrJCAAirr AATAATCCTT 1380 

50 

CXXACTTTCA TTGAGATCAG CTrGTCT G Ar AACCTGATAT GAGTGTGAJi^V ATGATAAACA 1440 

TGATAATAGT GGTACTTTTG TAATTTTGCT GCTGC^JITIA ASAASATAGT AAAKGATQAG 1500 

55 TTCAYCTTTT CTYCGAACAT YCCTA.riCCT AGATGCAG77 tACCTCAAAT TGGGAATTAT 1560 

AACTGTCCTA ATTTTTGTTG TGTACCCTGA TGCCCCTTrr GCTTTAATAC CCACAGTGTA 1620 

ACAATTAAAT ATCACACTAT GACATATGAr TTAA£?rAGG;;. CArTTTAAAG ATAAATTTTA. 1680 



60 
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GGGGTAAATC TTTACTTCAA AATGACTCCA TATTTCAAAT ATCTGTTTAG ACTGTGAAGG 1740 

CCAAATAATT TTTAAGAAAA CATTTGAAGA GTAGTGTGTT TGCATTTGTG AATAATCTTA 1800 

5 CTCACAGCAA GTAAACGTAA TAAAACSCXrAA CATTTAAGCC AAAAAAAAAA AAAAAA 1856 

10 (2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1558 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: double 

<D) TOPOIiOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

20 TGGGTATCCA TTCCTGNAAT TACTTTACTT AGGATAATGG CCTCCAGCTC CGTCCAAGTT 60 

GCTGCAAAAG GTATTATTTC GTTCCTTTTT GTGGCTGAGT AGTATTCCAT GGTGTATATA 120 

TACCACATTT TCTTTATCCA CTCATTGCTT GATGGGCAGT TAGGTTGGTT CCACATCTTT ISO 

25 

GCAATTGTGA GTTGTGCTGC TCCAGATATC ATCTTTAACT CCTTTGCCTT CTCCACATAC 240 

ATTTCCAAGT CCTGTTCATT CTACCTCCAA AATGTATCTT GTATCCATTC ATCTCTCTCC 300 

30 ATCTTCAATC TATTTCAATG CCCCATCATC TCTTGCATGG AGGAGTGTAA TAATTGGCTA 360 

ACTGGCCTGT TCTTACATTT TAAAATCAAA AGATGTGACA GGTGAAATGC CTATTTCAGT 420 

GTCCATTGAT GGTTCTGCTT ACACACCACC TGGCTGCCTG GTGTCGCAGT GGCAGAGTTG 480 

35 

AGCAGTGTGA AAAAGACTGC TTGGCCCTTT ACAQGGAAAG CAGGTCCACT GTGGCCTGTG 540 

AGGACGAGAG CTCTGGGCAG GCTCG6ACAC TQGCAGACCC TGGTCCTGGC TGGCCAAGGC 600 

40 AGCAGGGTAT GTGTTTCGGG TCACTCACAG GGCTCAGCAC CACTCCTCAT GGCTTCCTTA 660 

CTGTTTCGGC AGAGGCTGAC CCGCGGCTGA TTGAGTCCCT CTCCCAGATG CTGTCCATGG 720 

GCTTCTCTGA TGAAGGCQGC TGGCTCACCA GGCTCCTGCA GACCAAGAAC TATGACATOG 780 

45 

GAGCGGCTCT GGACACCATC CAGTATTCAA AGCATCCCCC GCCGTTGTGA CCACTTTTGC 840 

CCACCTCTTC TGCGTCCCCC TCTTCTGTCT CATAGTTGTG TTAAGCTTGC GTAGAATTGC 900 

50 AGGTCTCTGT ACQGGCCAGT TTCTCTGCCT TCTTCCAGGA TCAGGGGTTA GGGTGCAAGA 960 

AGCCATTTAG GGCAGCAAAA CAAGTGACAT GAAQGGAGGG TCCCTGTGTG TGTGTGTGCT 1020 

GATGTTTCCT GGGTGCCCTG GCTCCTTGCA GCAGGGCTGG GCCTGOGAGA CCCAAGGCTC 1080 

55 

ACTGCAGCGC GCTCCTGACC CCTCCCTGCA GQGGCTACGT TAGCAGCCCA GCACATAGCT 1140 

TGCCTAATGG CTTTCACTTT CTCTTTTGTT TTAAATGACT CATAGGTCCC TGACATTTAG 1200 

60 TTGATTATTT TCTGCTACAG ACCTGGTACA CTCTGATTTT AGATAAAGTA AGCCTAGGTG 1260 
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TTCTCAGCAG GCAQGCTGGG GAGGCCAGTG TTGTQGGCTT CCTGCTGGGA CTGAGAAGGC . 1320 

TCACGAAGGG CATCCGCAAT GTTGGTTTCA CTGAfiAGCTG CCTCCTGGTC TCTTCACCAC 1380 

5 

TGTAGTTCTC TCATTTCCAA ACCATCAGCT GCTTTTAAAA TAAGATCTCT TTGTAGCCAT 1440 

CCTGTTAAAT TTGTAAACAA TCTAATTAAA TQGCATCAGC ACTTTAACCA AAAAAAAAAA 1500 

10 AAAAAAAAAA AAANAAAAAA AAAAGGGGGC CGCTCTAGAG GTCCAAGTTA NGACX3NGG 1558 

15 (2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 948 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEIX^S: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

25 TAAAAATCAT GCTCTGTACC ATCCTCACCG TAGTCATCAT CATCGCCGCG CAGACCAOGA 60 

GAACTACTGG GATCCCTAAA AACGCCCCTG GTCCGGCCCC ACTCTGCGCC CCTCGATCTC 120 

CCAGGCTCTT TCTGCAGWCA TACCGCGGAC CCAATGGGCG CCCTGCACAC CCGTTTCTGG 180 

30 

GGCCGTCAGA CTTGGATACA TCGTAAACTC CGCCTCCACG GAACGTCTCG CCTKGCGAGC 240 

AAGMTCGGAA TCCAGTTCCT CAGGAACCCC TCCAAAACCC ACACCCCCAG GGACGCCGCT 300 

35 TTCCQGGATC CCGGSCAAAC GCCGGACCCT CAGTCGCTCC AGGCCCCCTC ACCCTCAAAG 360 

TGTAGCGCCC CCAACCGAGC AACCTCGGTT TGGTCCCTAA AACCCCGCCT CCTCTATAAG 420 

CACCGCCCCA GCTCTGACAA AACCCCGCCT CCAGGTCGGC AGGCTCCGCT TCTTTTCTTC 480 

40 

TCCGCGGGGT GATTCAGTCC AGTGATTGGG TTTGTGGCTC CAGGCCTCGC CCACAGACGG 540 
ACAGACCCCT CCCTTTCTTC CGGCAAAAGG ACCGAGCCCT GGGGTAGTAA GGSCCCCACA . 600 

45 CTCCTGTTTT TTGCAAGTAC ATTTTTGTCC YTCCTCCACC CAGGTATCTG CCTATTTTCT 660 

TGCTAATCCC AGAACCTTTC CTTTTGCTTT TTTTAAGGAC ATTTQGGAAG TTCCTGGTGT 720 

AGGACCCTTC TCCCTGQGAT AAGAAACCTG CCTGTAAACG CTCTGTAAAT ACTCCCTTCC 780 

50 

ACOCATCCCA GCCCCTGGGC AOCOGGGCAG AAGGGAATCC AGGCTATGGA CCTCCCAAGT 840 

CCCCGCTtCC CGCTCCCCTC GGCGGCCCCG CCTTGTTCTG ATCTGTGTGT GAGTGTGTGT 900 

55 GAACTTCTGA AAGACAATAT TAAAGAGACT TAGTTGAAAA AAAAAAAA 948 



60 (2) INFORMATION FOR SEQ ID NO: 55: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

10 GGQGAACTGC AGTGACAGCA GGAGTAAGAG TGGGAGGCAG GACAGAGCTG GGACACAGGT 60 

A1GGAGAGGG GGTTCAGCGA GCCTAGAGAG GGCAGACTAT CAGGGTGCCG GCGGTGAGAA 120 

TCCAGGGAGA GGAGCGGAAA CAGAAGAGGG GCAGAAGACC GGGGCACTTG TGGGTTGCAG 180 

15 ■ 

AGCCCCTCAG CCATGTTGGG AGCCAAGCCA CACTGGCTAC CAGGTCCCCT ACACAGTCCC 240 

GGGCTGCCCT TGGTTCTGGT GCT T CTGGCC CTGGGGQCOG GGTGGGCCCA GGAGGGGTCA 300 

20 GAGCCCGTCC I^SCTCGAGGG GGAGTGCCTG GTGGTCTGTG AGCCTGGCCG AGCTGCTGCA 360 

GGGGGGCCCG GGGGAGCAGC CCTGGGAGAG GCACCCCCTG GGCGAGTGGC ATTTGYTGCG 420 

GTCCGAAGCC ACCACCATGA GCCAGCAGQG GAAACCGGCA ATGGCACCAG TGGGGCCATC 480 

TACTTCGACC AGGTCCTGGT GAAOGfljQGGC GGTGGCTTTG ACCGGGCCTC TGGCTCCTTC 540 

GTAGCCCCTG TCCGGGGTGT CTACAGCTTC OQGTTCCATG TGGTGAAGGT GTACAACCGC 600 

30 CAAACTGTCC AGGTGAGCCT GATGCTGAAC ACGTGGCCTG TCATCTCAGC CTTTGCCAAT 660 

GATCCTGAGG TGACCCQGGA GGCAGCCACC AGCTCTGTGC TACTGCCCTT GGACCCTQGG 720 

GACCGAGTGT CTCTCCGCCT GCGTCGGGGG NAATCTACTG GGTQGTTGGA AATACTCAAG 780 

TTTCTCTGGC TTCCTCATCT TCCCTCTCTG AAGGACCCAA GTCTTTCAAG CACAAGAATC 840 

CAGCCCCTGA CAACTTTCTT CTGCCCTCTC TTGCCCCANA AACAGCANAA GCAGGANANA 900 

40 NACTCCCTCT GGCTCCTATC CCACCTCTTT QCATGQGAAC CTGTGCCAAA CACCCAAGTT 960 

TAAGAAAAAA ATAAAACTGT GGCATCTCCA 990 



25 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 56: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1603 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

GGTCGACCCA CGCGTCCGGC CCXSCCGGCTC CGGAGCGGCT CTGCCTTCCC GAGCGCGGGA 60 
CCGCGCCCTC GGGGAGGAGG GCGAACGACG CQGCGATQGC TCCGCGGGCA CTCCCGGQGT 120 
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CCGCCGTCCT AGCCGCTGCT GTCTTCGTX3G 
CGGACAATGG GAGCAGCCGC ACATTGCACT 
5 ACX5ATACTGG GAATGGACAC CCAGAATATA 
TCATCGGTCT CTTTGGCX3TC CTCATTTNGC 
GTACAACAGA AGCAGAGCAA GATATCGAAG 

10 

GACAGTOTGA ATGAAAACAG TGACACTGTT 
GAAGC33AATG CTGATGTYTT AAAGGCGATG 
15 AGCCCCGTGA CCCCCAGCAC ACCAGGGAGC 
GGGGGACGCC AGGGAAGCAC GTCTGTGGCC 
AGAGGGATGT GTGTCATCGG TGTAGGCACA 

20 

AGTCCAGAGA GAGCAGACCA CGGCGCCAAG 
TTAGAGTNAC AAAAGTGGAG CACAAGTCAA 
25 TTAGTGGGGC TGAAACCGTC AATGGGGAGG 
GTX3GCACAGA GTAGCAGGTG AGCCGTGGTT 
GTGAGGAGAA GGTACTTGGA GCCTCCCAGG 

30 

AGGGAAGTGG GAGAGCTTTC CTTGACCCAG 
TGTCTC3CCTA GAGCTTCTTG TAAAGAAGTC 
35 TGTGATAATG AGGATAGAGG ATTACTTGTG 
AACTAGAATT CACATCACCC ACCAIATAGG 
AGTGTTGCTG CATCTTCTTA CGCAAAAAAG 

40 

ACTGATTTTC GATATTGGCA GCTTACTTTT 
CTTGTAATCT TGTCACCATT TTTAGGTAAA 
45 ACCAATGCTT TTTCCTTTTA TTCTGTTGGR 
ACCTCAATAC GAATATTTCT CTTCCCACCA 
GATTTATTTC TTGATTTGGC AATTTTAATT 

50 
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GAC5GCGCCGT GAGTTCGCCX; CTGGTGGCTC 180 

CCAGAACAGA GACGACCCCG TCGCCCAGCA 240 

TTGCATACGC GCTTGTCCCT GTGTTCTTTA 300 

CAMCTNGCTT NAAGAAGAAA GGCTATCGTT 360 

AAGAAAAAGG TTGAAAAGWT AGRATTGAAT ' 420 

GGGCAAATCG TCCACTACAT CATGAAAAAT 480 

GTAGCAGATA ACAGCCTGTA TGATCCTGAA 540 

CCGCCAGTGA GTCCTGGGCT TTGTCACCAG 600 

ATCATCTGCA TACGGTGGGC GGTGTWGTCG 660 

AGCGGTGGCA CTTTATAAAG CCCACTAACA 720 

GCGAGGTCAC GGTCCTTTCT GTTGGCAGAT 780 

ACCAGAAGGA ACGGAGAAGC CTGATGTCTG 840 

TGCCGGCAAC ACCTGTGAAG AGAGAACGCA 900 

TTGGTGACAT TQGGGGCAGA GTGGTGCAQG 960 

TGCTGTGGCA GCATAGGAAT GGTATTTGAC 1020 

GAAGACTGAG GGGGACTGAA CATGATTACT 1080 

ACAAACTTAG TGCCTCCAGG GGCTTGGCTG 1140 

AGGCAATGTG GCATGGTGGG GATTGTGGCA 1200 

GCTTGCATTA CCACGAGGCA GAAAGCACCT 1260 

ACAAAATCCA GACTTCTAAA ATGTAAAATC 1320 

TTTTTTTAAA CAACCATGCA GQCCAAATGA 1380 

CTGTGACTTG AAAAAGTCTG GAGCAAACAA 1440 

AACCAGTTTT CTTTGTGTCA CAGTTVTGAA 1500 

AATATTITCA GGCAATTGAA AAGCCACAGT 1560 

TTGCAAGACA ATT 1603 



55 



(2) INFOBMATIQN FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
60 (D) TOPOLOGY: linear 



wo 98/54963 



PCT/yS98/11422 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
TACAGCTCAG GATGCCTGTA ACATTGTCAT CTCTQGGCTT CTGGGTCCTG CTTAGCCTGC 60 

5 

TTTTTCCCTG GAGGACTGAC CAGGGATGCG (XTCCAGCAAC ATGTTACTAA ATCATACTCT 120 
CCTCCCTACC TTTCCCAGAC CTCTCACTCC TGCCTGCnGT TCCAACCCGT TCTGTGGCCA 180 



10 GAGTATACAT TTTGGAACCT CTTCGAGGCC ATCCTGCAGT TCCAGATGAA CCATAGCGTG 240 



CTTCAGCAGN AAGGCCCGAG ACATGTATGC AGAGGAGCGG AAGAGGCAGC AGCTGGAGAG 300 



GGACCAGGCT ACAGTGACAG AGCAGCTGCT GCGAGAGGQG CTCCAAGCCA GTGGGGACGC 360 

' 15 

CCAGCTCCGA AGGACACGCT TGCACAAACT CTCGGCCAGA CGGGAAGAGC GAGTCCAAGG 420 



CTTCCTGCAG GCCTTGGAAC TCAAGCGAGC TGACTGGCTG GCCCGTCTGG GCACTGCATC 480 
20 AGCCTGAATG AGGCTGGCCA CCTGCCACTT TGCCCTGCCC TCTGCCTCCA QGGCTCCMCT 540 



MYCCTTCCTT TTCTTGGTGA AAGGCACCTC CTTTCCTGAT AATGAATGGT GTTCCCTTTG 600 



CTTGGCTGGG GAGCCCCCCA GGCCAGGTTT GCTGGCCATA GATACCTTTG GGCTGCCTGR 660 

25 

. GACAGGCTCC TGAGGAGGAT TGAQGGTGAA AGTCTCCCAC GAGTACACTA AACCTAGGTC 720 



TGGTCACCAA TAGGGTTTGG AGAGCAAAGG GCCACAACTC ATCAGCTGCC TGTCTCTTAG 780 



30 ATGCACTTTC TTTTTCCACC AGCACATCCT TCAACACACA GAATTTCAGG GAAGAGTTCT 840 



CCCCAAAACC CTAGCTCTTT ACCCTTCCAT TTTAGCCTTC CACCCAGCTT CCACAAAAGA 900 



TTTGGCTCTA CCTTGGATCT GCTAGTAAAT AACTAATAGG CAQGCAGTTA TTTGQGTAAG 960 

35 

' GAAAAAAGGG GTGGGAGAGA CAGAAAATTT GCCCACTGCT GCTCCTCCCC TTGGSTYTCC 1020 



ACCTGGGATT TGCTATTGAA TCTCTACCCT NN 1052 



40 



(2) INFORMATION FOR SEQ ID NO: 58: 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
ACNCGNTGGC GGCCGCTCTA 6AACTAGGGG ANCCCCCGGG CTGCAGGAAT TCGGCACGAG 60 



55 CATAGACTTT TAAACTGGTA CGGTTCTTAG AGATGGTCCT TGGCCTTCTG TTGTTGTTGT 120 



KGTTTTTTTC TTTTTCTTCT TCTCCTTCTC CTTCTTCTTC TCTTCTCCTT CTTTCTTCTT 180 



TTTTTTTTCA GAGTCTTGCT CTGTCACCAA GACTGGAGTG AAGTGATGTG ATCTCGGCTT 240 

60 
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ACTGCAACCT GGGAGGCAGA GGTTGCAGTG AGTCGAGATG GTGCCATTGC TCTCGTTTCG 300 

GCAACAAGAG TGAAACTCTT GTCTCAAAAA AAAAAAAAAA ATGAGGTTTA AGACAGITTT 360 

5 GTCATTACTG GTGQGATCTG GTCACACAAG ATAGCATTAA ACGTGACATG GCACATAAAA 420 

TTGGTTAAAA AATTTTGTTT TTTAATTACG TAATGTAAAA GCCCAACAAA CACTTTATGC 480 

AAGATTGGAA TGTATCTTCA AATTCAGATT TAATAAACAT GTAAAGATCC TCTGTATATA 540 

AAAGTTGTAT TTAATCCCTT GTGCCCCAAG AATGCTATAA AAGATCCCAA GAATGTTATC 600 

TATGAAAAGA TAGCAATAGG GAATGGTGAA CAAATAATTT AATTTGCCAA TTCTAAAAAA 660 

15 CATGGACTTA AACCCCATGA AAACTTGGTT CCATAGTTTT AACTGTTTTA TGGTTCCAAT 720 

ACAAAACCAG AGTGGTTTAC ATTCCACAAT NACCAAATTT GCATCCAATN TTGGGGTAAT 780 

TTTNGGTATT TGCCATGGGA TACTATTCAT TTTT 814 

20 

(2) INFORMATION FOR SEQ ID NO: 59: ' 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: double 
30 (D) T0POU)GY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

AGAGGAAGTC TTTTGCCAAG CCTGTTCTCT GGACTAACGC CATCCAGGCT GGGAGGGGAA 60 

35 

GAGTGCTCTG CTACACTCGT CCCCCTCCTG CCTCATCTTC CTTCTCAGCC TTGGTTCCTC 120 

. ATQGGAACAG AATQGAGGGC CTGAGAACAT ACTTTCTAAA TGCCTTTGAC CCAGGAACCG 180 

40 ATTATCTATA TTTGTTCCCA TTTTCCTTCA CCGTGACATT CCAGCATTGT CTCACTGTCA 240 

GGTGQGCCTT TGAGAGCCTC CAGGTTCCTC AAAACAGGCC TGAGCGATQG GCATCACACC 300 

CTCTGCCTAC CCACRTQCCT GCTTACCTGC CAGATAACCA AGTGNAGATG TCTGCGAGTC 360 

45 

GCTAGTTTTC ACATTCTTAC TAGTGTTTGG YTCACCTTTG GGCAAAGGCC CCCTCTAGGC 420 

CTTGCCCCAC CTCCATCAAA CGCAGACACT GTAGTCAGAC CTCAGYAATA TAGGAGGCAA 480 

50 TAATCTTTTA ACAGTGTTTT GCAAACAAAC AAAAAGAGAA AAATCCCAGC CAGGGGAACT 540 

CGCCACCTGC CCACGCTAGT TCCATCCACG CTCAAGACCC GCCCTTAGAC CAGGCAGGCA 600 

AAGGCCCCCA TCACACTCGG CCACTAGTCG GGTCCTGAG6 CCAAGAAAGA AACCAGACCC 660 

TGTATGACAA GTTQGGKTCT TTCCAGAACA CGACAGAAAC AGQQGGGGCC CCTTGTTAAT 720 

GCCACTCCAT ACTCCAGAAG CATTATTCCT TATTTGGGAC AGCCAAGGGC AGATTCACAG 780 

60 GTTATTGTAG GAATAAAGAC TAGTTTACAA AGGARAAAGA GSCCCTGGAC TTCCCMAGGA 840 
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AAGGTCAGGT TAGGGCTCCT GTACCXATTC TGTTCCACCA CTGTTTGATC TXrrCTGGCCT 

ccx:accagga atgccxstttc cmrrATGG atctcttggg aaccagagag aatcaacaga 

TCAATGACAT AGGATCCX5AA GTGCAATGAT AGTCACTTCT AGTTTGGCAT TTCACAAACT 
CTGNACAGCA AGGTATTGGT AGGTTACTCA ATTTCAAAAG GGCCCCATGG CCAAATATGT 
TTAGGAACCG CTGTTTGNAT TTCTTTTTTT GGAGACGCAT TGTATATAAT ATATGTCAAA 
GGCTTTCX3GA ATTCCTGCAG GAAAGAAATC AGCTTTGTTA AATCCNAAAA AAAAAAAAAA 
AAAAAAATAG ACTCG 

(2) INFORMATION FOR SEQ ID NO: 60: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J; SEQ ID NO: 60: 
ATTTCTTATG ACATGGGGGT TTGAATrGGT TGGCAAATGT TTAATTTTAA TATCCATAAT 
CAGTGAGGTC CTGCTGGCTG TAATCATTAA TTGTGAAATC TAAGGAGCTT AGTTCATCGC 
TCTAGAATTT CACAGAAAAR TGYGMPATGA TACGAGCATT AAGTTTATTT CTTCTGATCT 
TTGATGCAGC TTTGTTCAGT TTATCTGTTT TTGTATTTAT TGGTCATCTA CTTCCCATOC 
CAAAAGGGAC TGGTCTACAT AGCTQCGCTA AACACCTGAT CAAATCACTA AAAGAAAATG 
TGTTACCTCT AATGAATTAT CCTGATTGTA AGTTAAAAAT CAATATTTCC CCGTAGTCAG 
GTTTGCTTTT TAAAAAGAAK KCTTAAAAAA AAAAAAAAAA AAACGACHTN AAGAAAAGGA 
AGCAAGCTCA GGTAAGGTGC ACACATTGGG CTAAGGAAGC TAGAGCCTGT QGAGANGC 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIFTICN: SEQ ID NO: 61: 
TATGACCTTG ATAACCCCAA GTTNGAAATT AACCTTCAMT AAAGGGAACA AAAGCTQGAG 
TTCGCGCGCT TGCAGTTCGA CACTAGTQGA TCCCAAAGAA TTCGGCACGA GTCATAATGA 
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GCTACTAGGT AAGCCTTCTG GGACTTTCAG ATATTTTGGG GAAGATTGAT TTTTGTTCTT 
ACATGCTGTG GACCCTTGGC CATCAAATGG TATGGGGAAG CTCATCCGTC TGTCTCTGAT 
GGTCATGTCA GTCAGGCGTC TTTTTAGTAT TTACTGGGTG CTCAGTACTG TGCCAGATGC 
TGTCQGGAGC CGTOGTCdTA TGGAGGAGGA GTGCTCCAGA GGACTCTGCT GTGTQGCAGG 
CCAGCATAAA CAAGCCAAGG GGAAAAGGCA GGCATGGAAT AAAGGGGGAG AATACCAGT6 
TGTGACTTAC TGCTGACTGT GTGGATTAGC CTATCAGCAG TAATCAAGCA GGGCGGAGGG 
CATTATCTTT GAGCCAGAAG AGTGAGCACT GGSCCGAGGG TGGAGCATCA AGAGGGGGTG 
TAGGACCNCA AGGCTTCTTN CNGGGGAGAC AACGTCAATA AGCNGTCAGT AGTCACCGAC 
AGTTTTGGGA AGCAAGGG 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOCSY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
TCGACCCACG CGTCCGAGGA GCTGGACTTC TGAGACAGCC ATTCTCCTTG CATAGCACTG 
TCTGCTGCTA CAGCTCATAG AAGTCAACAA rTTTCTTCAA CACTGGTAGG CAGCCTCTAA 
ATGGCCCTGA TCACCCTCAC CTCCTGCCAT TCACACCNNT GTAAAATTCC ACCCCTGGAC 
CTAGTGACTC ACTTCTAACA ANGAlGAATAC AGCAAAAGTA ACATCGCTTC TGAGGTGAGG 
CTACAAGGAG ACTACGATGC CTGCCTTtSGT CACCCTTCTC CTGCTCTTTC CATTGCTCCC 
TCTGATGGAA GCCAGTTGCC ATGTGATGAG GTGCCCTATG GAGAGGCCCA CGTGACAAQG 
TATTGTAAAA AGCCTCTGAC CAATAGCCAT CTAGAAACGG AGGCCCAGTC CAGCAGCCTC 
TGAGATGAAT CCTGCCAACC TGAGCTTGGA GACAGATTCT CTCCCTATCC TGCCTTGGGA 
TGATCACAGC CACCACCAAC ACCTTCACTG CCTQGTGAGA GGCCAAGCCA GTGAACCCAA 
GGTAAACTGG ACAGAATCCT GACCCACAGA AACTGAGATA ATGTTTGTTA TTTTAAGCTG 
CTCAGTTTGT TACAGAGCAA TAGATAACTA ACTCAAACAC CATAAAATTC TAATATTTTA 
TTCTATCACA CAAACCAGGT AATACCAAGT AAATGCCATT ACTATACACA TATTITTGTA 
ACACAATTAC ATGTGATTTT TTAAGAAGGC T 
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(2) IIZr'C-rl'EVnON FOR SEC ID NO: 63: 

(i) SSQUSTCE aS^jyvCTERISTICS: 
5 (A) ISXSTH: 780 base pairs 

(3) TYPE: nucleic acid 

(C) STRAISECNESS : double 

(D) TOPCICGY: linear 

10 (xi) S3CUHNCE E£SCRIPTIC»7: SBQ ID NO: 63: 

CNQ;C5aGTCA OJGTCCCCGA TTCCCGGGTC GACCCACGCG TCCX3GGTTGG CAACTCCTGA 60 

GGCCTGCATS GCTGACTTCA CATTTTCCTA CCTCTCCTTC TAATCTCTTC' TaGAGCACCT 120 

GCTATCCCCA ACTTCTAGAC CIGCTCCAAA CTAGTGACTA GGATAGAATT TGATCCCCTA 180 

ACTCACTOTC TGCGGTGCTC ArTGCTGCTA ACAGCATTGC CTGTGCTCTC CTCTCAGGGG 240 

20 CAGCyTGCTA ACGGGGCXSAC GTCCTAATCC AACTGGGAGA AGCCTCAGTG GTQGAATTCC 300 

AGGCACTGTG ACTGTCAAGC TGGCAAGGGC CAGGATTCGG GGAATGGAGC TGGGGCTTAG 360 

CTCGGAGGTS GTCTGAAGCA a^JCAGGGAAT GGGAGAGGAG GATGGGAAGT AGACAGTGGC . 420 

TGGTATSGCr CT3AGGCTCC CTGGGGCCTG CTCAAGCTCC TCCTGCTCCT TGCTGTTTTC 480 

TGATGATTTG GGGGCITCG3 A3TCCCTTTG TCCTCATCTG AGACTGAAAT GTGGGGATCC 540 

30 AGGAT-SGCCr TCCTTCCTCT TACCTTTCCT CCCTCAGCCT GCAACCTCTA TCCTGGAACC 600 

TC?rccrccc? TTCTCCCCAA CTATGCAJTCT GTTGTCTGCT CCTCTGCAAA GGCCAGCCAG 660 

CTTGGGAGCA gc::ag?.gaaat AAACAGCATT TCTGATGCCA AAAAAAAAAA AAAAAAAACC 720 

GCGGCCGAAA GCTTArTNCC CTTTAAGTAA GGGGTTAATT TTTAGCTTGG GCACTNGGCC 780 



25 



35 



40 



50 



(2) IIS'OEMATrQN FOR SEQ ID NO: 64: 



(i) S2Q0SICE Clff-RACTERISTICS: 

(A) LENGTH: 588 base pairs 
45 (3) TYPE: nucleic acid 

(C) STRAIIDEDNESS: double 

(D) TOPOIjOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
TTCCGAATTA ATCGACTCAC TATAGGAAOT GCCGTCGCCA TGACCCGCGG TAACCAGCGT 60 
GftCCICGCCC GCCftGAAGAA TATGAAAAAG CAGAGCGACT CGGTTAAGGG AAAGCGCCGA 
55 GATGA3GGGC TTTCTSCTGC CGCCCGCAAG CAGAGGGACT CGGAGATCAT GCAGCAGAAG 

CAGA?.AAAC-G CAAACGAGAA GAAGGAGGAA CCCAAGTAGC TTTGTGGCTT CGTGTOCAAC 240 



60 



ccrcriGccc ttcgccigtg tgcctggagc cagtcccacc acgctcgcgt ttcctcctct 



120 
180 



300 
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AGTGCTCACA GGTCCCAGCA CCGATGGCAT TCCCTTTGCC CTGAGTCTGC AGCQGGTCCC 
TTTTGTGCTT CCTTCCCCTC AGGTAGCCTC TCTCCCCCTG GGCCACTCCC GGGGGTGAGG 
GGGTTACCCC TTCCCAGTCT TTTTTATTCC TGTGGGGCTC ACCCX:AAAGT ATTAAAAGTA 
GCTTTCTAAT TCCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAANNCGGG GGGGGGCCCC CCCCCCCC 

(2) INFORMATION FOR SBQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
TTTAAAGATG AAGAAATGAC AAQGGAGGGA GATGAGATGG AAAGGTGTTT GGAAGAGATA 
AGGGGTCTRA GAAAGAAATT TAGGGCTCTG CATTCTAACC ATAQGCATTC TCGGGACCGT 
CCTTATCCCA TTTAATTAAT TTCTCTGACA ATTCAATTAT TTTCTGTTAT TAATGTTGCC 
ACTGCTTTCT GTTTGTCTGC ACTTTCTTGA TAAATATTTG CTATCGTTTT ACTCCAGTCA 
TTCGATCTTC CTCAGATTTA CATATGACTC TTGTCAACAT CTCATCTTTT GACCCAATCT 
TATTCATTTA ATAAGAGGTC TCATTCATTT GCATGGAAAA ATGCTCATTG TATATTGCAA 
AGTGAAAATA ACGAGTTGCA AAACAGTGTA TACATATATG TGTGTATATA TGTACACTTT 
ATTTOTACAT TTCTATGTCA CATAATQCAA AGGAAAGTGT CTGATTTTAT TATACACCAA 
AGGTTAACAG TCAATCTCTG TGTGATCTCT TTTTTTTTCT TTTTQCCTAT CTQCATCTTC 
TCACTTGCCA AAAAATGAAT ATATGTTTAT GTGTGrTATAT TACTTGTGTC ACAAAAAACC 
CTAAAGTAGA CAGTAAAAGA ACTTGTCAAT CGCCTTTGGA AGGCAATGAA ACACTTAATA 
AACTCTCAAT AACAGAAGCG TAAAAATGAA ATGTAAACCT CCAATTACCT CTGGATCTCT 
TAGCCAGAGT AATAAACTGG TAATTATTAC AGATAAAAAA AAAAAAAAAA AANA 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 1866 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTIOM: SEQ ID NO: 66: 

ACCCACGCGT CCGGTCCTCT TCTTCAGCAC ATGCCAAAGC TGTTCCTCAC GGCCTGTGAG 60 

5 ACAAGAGCAT CTTGGATGTA GGACAATGGA AGAGTTAGAT GCCTTATTGG AGGAACTGGA 120 

ACGCTCCACC CTTCAGGACA GTGATGAATA TTCCAACCCA GCTCCTCTTC CCCTGGATCA 180 

GCATTCCAGA AAGGAGACTA ACCTTGATGA GACTTCGGAG ATCCTTTCTA TTCAGGATAA . 240 

10 

CACAAGTCCC TTCCCGGCGC ANTCGTGTAT ACTACCAATA TCCAGGAGCT CAATGTCTAC 300 

AGTGAAGCCC AAGAGCCAAA GGAATCACCA CCACCTTCTA AAACGTCAGC AGCTGCTCAG 360 

15 TTGGATGAGC TCATGGCTCA CCTGACTCAG ATGCAGGCCA AGGTTGCAGT GAGAGCAGAT 420 

GCTGGCAAGA AGCACTTACC AGACAAGCAG GATCACAAGG CCTCCCTGGA CTCAATGCTT 480 

GGGGGTCTSG AGCAGGAATT GCAGGACCTT GGCATTGCCA CAGTGCCCAA GGGCCATTGT 540 

20 

GCATCCTGCC AGAAACCGAT TGCTGGGAAG GTGATCCATG CTCTAGGGCA ATCATGGCAT 600 

CCTCAGCATT TTGTCTGTAC TCATTGCAAA GAAGAGATTG GCTCCAGTCC CTTCTTTGAG 660 

25 CX3GAGTGGCT TGGNCTACTG CCCXAACGAC TACCACCAAC TTTTTTCTCC ACGCTCTGCT 720 

TACTGCGCTG CTCCCATCCT GGATAAAGTG CTGACAGCAA TGAACCAGAC CTGGCACCCA 780 

GAGCACTTCT TCTGCTCTCA CTGCGGAGAG GTGTTTGGTG CAGAAGGCTT TCATGAGAAG 840 

30 

GACAAGAAGC CATATTGCCG AAAGGATTTC TTAGCCATGT TCTCACCCAA GTGTGGTGGC 900 

TGCAATCGCC CAGTGTTCGA AAACTACCTT TCAGCCATGG ACACTGTCTG GCACCCAGAG 960 

35 TCCTTTQTTT GTGGGGACTG CTTCACCAGT TTTTCTACTG GCTCCTTCTT TGAACTGGAT 1020 

QGACGTCCAT TCTOTGAGCT CCATTACCAT CACCGCCGGG GAACGCTCTG CXrATGGGTGT 1080 

GGGCAGCCCA TCACTGGCCG TTGTATCAGT GCCATGGGGT ACAAGTTCCA TCCTGAGCAC 1140 

40 

TTTGTGTGTG CITrCTGCCT GACACAGTTG TCGAAGGGCA TTTTCAGGGA GCAGAATGAC 1200 

AAGACCTATT GTCAACCTTG CTTCAATAAG CTCTTCCCAC TGTAATGCCA ACTGATCCAT 1260 

45 AGCCTCTTCA GATTCCTTAT AAAATTTAAA CCAAGAGAGG AGAGGAAAGG GTAAATTTTC 1320 

TOTTACTGAC CTTCIGCTTA ATAGTCTTAT AGAAAAAGGA AAGGTGATGA GCAAATAAAG 1380 

GAACTTCTAG ACTTTACAT6 ACTAGGCTGA TAATCTTATT TTTTAGGCTT CTATACAGTT 1440 

50 

AATTCTATAA ATTCTCTTTC TCCCTCTCTT CTCCAATCAA GCACTTQGAG TTAGATCTAG 1500 

GnXXTTarArC CTOjTCCCTC TACAGATGTA TTTTCCACTT GCATAATTCA TQCCAACACT 1560 

55 GGTTTTCTTA GGTTTCTCCA TTTTCAOCTC TAGTGATGGC CCTACTCATA TCTTCTCTAA 1620 

TTTGGTCCTG ATACTTGTTT CTTTTCACGT TTTCCCATTT CCCTGTQGCT CACTGTCTTA 1680 

CAATCACTGC TGIGGAATCA TGATACCACT TTTAGCTCTT TGCATCTTCC TTCAGTGTAT 1740 

60 
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ri TT GT rrrT CAAGAGGAAG TAGATTTTAA CTGGACAACT TTGAGTACTG ACATCATTGA 1800 
TAAATAAACT GGCTTGTGGT TTCAATAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1860 
5 AAAAAA 1866 



10 (2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1152 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

20 CTCAAGGATC TAAAGGCTCT GCAGATTTCG GGAGGCCTGT CTCCCAGCAC CTGATGGGAC 60 

ACTTTTTGCC CCACTGTAAA TTCTC5GGTGT ATCCTCCACT GTATGCTGTC ACCCCAAGGG 120 

CAAGCACrcC ATCTGCTTAG TGAAGGATTT ATTGTTCGGA AGATACATTT TCCCCTTKAG 180 

25 

CAGAGAGTCG CGTATCCTGG CAGTCTTCGG TGAGCCAGTT GTACCAGGAT TATGAAATGC 240 

AGATGTTTAC TGTGTCATTG TTGCTGTCAT TGCTACTGAG GAGTACTGAC CAGAATCATC 300 

30 TGCAACTYTT AGTTGGCAGA GAGGACCACT ATGGOGQGTA GCTCTTTTCT TTCCTGCCAT 360 

TGTGGGGATG ATTCCAGGCC AAAGATGATG GARAAGTATG GAAATCATCT GAAAGGTTGA 420 

AGCTTGQCAC GTCAAGCCAT TCATGACTTT GTAAGGCAGT TTTGCTGAAG GCCAGTTCTG 480 

35 

CCCTCGGAGG GACGGAGGTG AATCCTCCTG AGTACCTGTG GTTTTCTTAC TTCCTGCTGA 540 

ATTTACCTAA GTGCCTGTTG TTTGCTTGCT GTGGAGGCTT TCTGGTATTT CATTTCAGGT 600 

40 GCAGATGCCT TCACTTTCCC ACCRAAAAAA CCCCMACCAA ACCTAAGACC TTACTGCAAC 660 

TAAGTYTOCC AAGTACTTTT TAACCCAATG GGATGAACAG CCTGTGGTCT GCTCAGATCA 720 

CCCTGAGTCC GTGTGAGAAG GOfTNGGCTT TGCCAGGAAA TCCAGGAAGG . CAGGGCCGGG 780 

45 

CTGTGTTGGA AGCTQGCTTA GCTQGTGGGG CAGCCTTATT TCAATTAAAA QGGCATTGAC 840 

TCGGAGCAGC AGTCCTQGAG TTTGTTGCAT TTCCTATTGC CCTCAAAATG AGAAACCAQG 900 

50 AAAATAQCAG ATTGGAGCCT TCGAGAAGGC AGTAAATGGC TGTTTTTATT GACAAAAGGA 960 

AAACATTTTA CTGCCATCTC ACTGATGGCA TCTCACTGAC TTAAAATGAA GGCANGTTGT 1020 

AGTAAAAAAA AAAGICTACA TTTTTCCACC GCCACGTTCT TATATCCTGT TTGTCAGCCA 1080 

55 

CTGCTCANAA GGGCA'TOTTG TCTTGCGGAN TANAGGCGCT CTCCTTCCCT OGTTTTOCCT 1140 

ATAGGTTGOG TG 1152 
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(2) INFORMATION FOR SEQ ID NO: 68: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 2483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

AGCAQGCGGT GCGCTGGGGG CGGGAGCAGC GCGKAGCCCG GCTCGGCCAC ACCGATOGCC 60 

15 CGCCGCCATG GGCTCCTCGC AAAGCGTCGA GATCCCGGGC GGGGGCACCG AGGGCTACCA 120 

CGTTCTGCGG GTACAAGAAA ATTCCCCAGG ACACAGAGCT GGTTTGGAGC CITTCTTTGA 180 

TTTTATTGTT TCTATTAATG GTTCAAGATT AAATAAAGAC AATGACACTC TTAAGGATCT 240 

20 

GCTGAAASCA AACGTTGAAA AGCCTGTAAA GATGCTTATC TATAGCAGCA AAACATTGGA 300 

ACTGCGAGAG ACCTCAGTCA CACCAAGTAA CCTGTGGGGC GGCCAGGGCT TATTGGGAGT 350 

25 GAGCATTCGT TTCTGCAGCT TTGATGGGGC AAATGAA7VAT GTTTGGCATG TGCTGGAGGT 420 

GGAATCAAAT TCTCCTGCAG CACTGGCAGG TCTTAGACCA CACAGTGATT ATATAATTGG 480 

AGCAGATACA GTCATGAATG AGTCTGAAGA TCTATTCAGC CTTATCGAAA CACATGAAGC 540 

30 

AAAACCATTG AAACTOTATG TGTACAACAC AGACACTGAT AACTGTCGAG AAGTGATTAT 600 

TACACCAAAT TCTQCATGGG GTQGAGAAGG CAGCCTAGGA TGTGGCATTG GATATGGTTA 660 

35 TTTGCATCGA ATACCTACAC GCCCATTTGA GGAAGGAAAG AAAATTTCTC TTCCAGGACA 720 

AATGGCTGGT ACACCTATTA CACGTCTTAA AGATGGGTTT ACAGAGGTCC AGCTGTCCTC 780 

AGTTAATCCC CCGTCTTTGT CACCACCAGG AACTACAGGA ATTGAACAGA GTCTGACTGG 840 

40 

ACTTTCTATT AGCTCAACTC CACCAGCTGT CAGTAGTGTT CTCAGTACAG GTGTACCAAC 900 

AGTACCGTTA TTGCCACCAC AAGTAAACCA GTCCCTCACT TCTGTGCCAC CAATGAATCC 960 

45 AGCTACTACA TTACCAQGTC TGATGCCTTT ACCAGCAQGA CTGCCCAACC TCCCCAACCT 1020 

CAACCTCAAC CTCOCAGCAC CACACATCAT GCCAGGGGTT GGCTTACCAG AACTTGTAAA 1080 

CCCAGGTCTG CCACCTCTTC CTTCCATGCC TCCCCGAAAC TTACCTGGCA TTQCACCTCT 1140 

50 

COCCCTGCCA TCCGAGTTCC TCCOGTCATT CCCCTTQGTT OCAGAGAGCT CTTCTGCAGC 1200 

AAGCTCAGGA GAGCTGCTGT CTTCCCTCCC GCCCACCAGC AACGCACCCT CTGACCCTGC 1260 

55 CACAACTACT GCAAAGGCAG ACGCTGCCTC CTCACTCACT GTGGATGTGA CGCCCCCCAC 1320 

TQCCAAGGCC CCCACCACCG TTGAGGACAG AGTCGGCGAC TCCACCCCAG TCAGCGAGAA 1380 

GCCTGTTTCT GCGGCTOIGG ATGCCAATGC TTCTGAGTCA CCTTAACTTT GAACCATTCT 1440 

60 
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TTCGAATTGG CGTGGTATAT TTAACCAOGG GAGCX3TGTCT GGAAACC5CAA ACTATCATTA 1500 

ATTTCATACT AGTTTCTACC GTATCTGTAG GCATCCTGTA AATAATTCCA AGGGGAAAAC 1560 

TAAACGAGGA CGTGGGTTGT ATCCTGCCAG GTTGAGTGGG GCTCACACGC TAGGGTGAGA 1620 

TGTCAGAAAG OGCTTGTATT TTAAACAACC AAAAAGAATT GTAAGGGIGG CTTGCTGCCA 1680 

GGCTTCCACT GCCX3TTCCTG GGGGTGTGCA TCTTCGGGAA AGGTGGTGGC GQGGCGTCCA * 1740 

CTAGGTTTCC TGTCCCCTGC TGCTCCTTCC GTAAGAAAAT GAAATATTCT ATGCXTTAATA 1800 

CTCACACGCA AOVTTTCTTC TACTTTGTAA GTCGTTTGCG AGAATGCAGA CCACCTCACT 1860 

15 AAACTGTAAA CGGTAAAGAG ATTTTTACTT TTGGTCTCCG TGAGTCGCAT CTCTACTAAG 1920 

GTTTACACAG GAATTCCACC TGAAGACTTG TGTTAAAGTT CTACAGCGCG CACTGTTAAC 1980 

TGAACX3TCTT TTTCTTCAGC CTATACX3CGG ATCCTTGraP TGAGCTCTCA GAATCACTCA 2040 

20 

GACAACATTT TGTAACTGCT GCTGTTGCTT TCTACATACA CTTTATAAAG TGACATTTCA 2100 

AAAGAAATAA GGTGCCACAG TTTTAAACCA GAAGGTGGCA CTCTGTGGCT CXTTTOTAGTA 2160 

25 TTATAGCTAT ACTQGGAAAG CATAGATACA GCAATAAAGT ACAGTAATTT TACTTTTTTT 2220 

CTTGTGTTAC ATCTAAATTA CAACCXmTAA TTGCCACX3TG TGCACTTACT ACTCTCCAGT 2280 

ATGTCTTATT ACTCTCCAGT ATGTCACGCA TCTTTAACTT TTCACGTCCT ATGTTTGCTT 2340 

30 

TCTCCCATTT TTAAGAGATG GTAAGTTAAC TGGAATTGAT TTACTGAATG AAATTAAATG 2400 

CAGATATCCC TGTTTTTGAA ATAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 

35 AAAAAAAAAA AAAAAAAAAA AAA 2483 



40 (2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNES5: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

50 GAGAAATGGA GCTTTGTTAG ATAAAAATTT TTTCAACGCA AACAGTCATT TTCCAGTGAA 60 

AGGAGAGCGT. ATCCGCCGTA GGArGGACTT AGATCGTGTA AAAGCTGAGG CCACCGAC3GA 120 

TATAACCTCC GGGGTCCTTT GCCTCCTTTT CCTTAGACTC CCTCCAAACT CGTGTATCTT 180 

55 

TCCTTCAGCA GTACTGGGCT CCAOGCGAAC CTAGTCCTTT GTCTTTACCC TATTACCTTT 240 

CATAACATCC TACTTGAAAA GTARTTATTC AACCGCGTTT GAAAATGAGA ACAGGTTCAC 300 

60 AGARGCTAGG TTACTTGCGA AGGTCGTTCA ATTAGTAACC AGTAACGCCA GGACTGCCAG 360 
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TTTCTTGCTT CCGAATTCTC ATGOTAGCTT TCACCARGCT CCCCGTCMAA TGCTAACGTC 420 

AACTACTCAA CTAGATTAGC AAAAAGGTCT TTTAACAGAA TTCCTCCSTTT TCAGAGAGAG 480 

5 

TTTCTTTCAT GAAGCGCCCC ATTTCTACAG AGGAAAATAA ACTCCAAGCA GCCAGT 536 



10 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEENESS: doiable 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

20 

CCACGCGTCC GGCCTTTCTT GGCCAGAGGC GCCGGTTGGA CTCACGGGCG GGGCATGATG 60 

GGTAACAGGA CCGGTGGGGT CCCCAGGAAG TCCTAGAGGG GGTCGGGGTT TGGGTGGACA 120 

25 AGCTTTCCTC GTCCTCTCCC GACAGAGCTG ACGTGTCCTG GGTTCCACCG GGAGCGGGCA 180 

TTTCCACCGG ACGGGAGGGT TCGGGGTGTC CGGGGCTGGG GAATACGTAG GGGTTGCCGC 240 

GCGGTCTQGG GAGTTGGGGC GTGTGGCTGC AGTCCCGGGA GTTCTTGGAG GGGGTCGGCC 300 

30 

CACCGAGCTT CCGGACCGGC TGATCTGCCC GTAGCTTGCC GGANC3GARGG CGGAGCTGAC 360 

TCTCCGTCCC TTCTCCCATC CCCTCCAGTG GTGGGTACGG GCACCTCGCT GGCGCTCTCC 420 

35 TCCCTCCTGT CCCTGCTGCT CTTTGCTQGG ATGCAGATGT ACAGCCGTCA GCTGGCCTCC 480 

ACCGAGTCGC TCACCATCCA GGGCQGCCTG CTTQGTTCGG GTCTCTTCGT GTTCTCGCTC 540 

ACTGCCTTCA ATAATCTGGA GAATCTTGTC TTTGGCAAAG GATTCCAAGC AAAGATCTTC 600 

40 

CCTGAGATTC TCCTOTGCCT CCTGTTGGCT CTCTTTGCAT CTGGCCTCAT CCACCGAGTC 660 

TGTGTCACCA CCTGCTTCAT CTTCTCCATG GTTGGTCTGT ACTACATCAA CAAGATCTCC 720 

45 TCCACCCOCT AOCAGGCAGC AGCTCCAGTC CTCACACCAG CCAAGGTCAC AGGCAAGAGC 780 

AAGAAGAGAA ACTGACCCTG AATGTTCAAT, AAAGTTGATT CTTTGTAAAA AAAAAAAAAA 840 

AAAAAAAAAA AAAAAAAAAA AAAAA 865 

50 



55 



(2) INFORMATIMI FOR SEQ ID NO: 71: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEIKESS: double 
60 ID) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
TCATCATATA CAAAGTTTTT CGTCACACTG CAGGGTTGAA ACCAGAAGTT AGTTGCTTTG 
AGAACATAAG GTCTTGTGCA AGAGGAGCCC TCGCTCTTCT GTTCCTTCTC GGCACCACCT 
GGATCTTTGG GGTTCTCCAT GTTGTGCACG CATCAGTGGT TACAGCTTAC CTCTTCACAG 
TCAGCAATGC TTTCCAGGGG ATGTTCATTT TTTTATTCCT GTGTGTTTTA TCTAGAAAGA 
TTCAAGAAGA ATATTACAGA TTGTTCAAAA ATGTCCCCTG TTGTTTTGGA TGTTTAAGGT 
AAACATAGAG AATQGTGGAT AATTACAACT GCACAAAAAT AAAAATTCCA AGCTGTGGAT 
GACCAAICTA TAAAAATGAC TCATCAAATT ATCCAATTAT TAACTACTAG ACAAAAAGTA 
TTTTAAATCA GrrTTTTCTGT TTATGCTATA GGAACTGTAG ATAATAAQGT AAAATTATGT 
ATCATATAGA TATACTATGT TTTTCTATGT GAAATAGTTC TGTCAAAAAT AGTATTGCAG 
ATATTTGGAA AGTAATTGGT TTCTCAGGAG TGATATCACT GCACCCAAGG AAAGATTTTC 
TTTCTAACAC GAGAAGTATA TGAATGTCCT GAAGGAAACC ACTGGCTTGA TATTTCTGTG 
ACTCGTGTTG CCTTTGAAAC TAGTCCCCTA CCACCTCGGT AATGAGCTCC ATTACAGAAA 
GTGGAACATA AGAGAATGAA GQGGCAGAAT ATCAAACAGT GAAAAGGGAA TGATAAGATG 
TATTTTGAAT GAACTGTTTT TTCTGTAGAC TAGCTGAGAA ATTGTTGACA TAAAATAAAG 
AATTGAAGAA ACACATTTTA CCATTTAAAA AAAAAAAAAA ACTNGAGGGG GGCCCGGTAC 
CCAAATCGCC.GCATAGTGAT CGTAAACAAT CT 



(2) INFOTMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 996 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CGCCTGGCAC CATGAGGACG CCTGGGCCTC TGCCTGTGCT GCTGCTGCTC CTGGCGGGAG 
CCCCCGCCGC GCGGCCCACT CCCOOGACCT QCTACTCCCG CATGCGGOCC CTGAGCCAGG 
AGATCACCCG CGACTTCAAC CTCCTGCAGG TCTCGGAGCC CTCGGAGCCA TGTQTGA6AT 
ACCTGCCCAG GCTCTACCTG GACATACACA ATTACTGTGT GCTGGACAAG CTGCGGGACT 
TTGTOQCCTC GCCCCCGTCT TCGAAAGTGG CCCAGGTAGA TTCCTTGAAG GACAAAGCAC 



QGAAGCTGTA CACCATCATG AACrCGTTCT GCAGGAGAGA TTTQGTATTC CTGTTQGATG 
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ACTGCAATQC CTTGGAATAC CCAATCCCAG TGACTACGGT CCTGCCAGAT CGTCAGCGCT 420 

AAGGGAACTG AGACCAGAGA AAGAACCXrAA GAGAACTAAA GTTATGTCAG CTACCCAGAC 480 

5 TTAATCGGCC AGAGCCATGA CCCTCACAGG TCTTGTGTTA GTTGTATCTG AAACTGTTAT 540 

GTATCTCTCT ACCTTCTGGA AAACAGGGCT GGTATTCCTA CXTCNGGAACC TCCTTTGAGC 600 
ATAGAGTTAG CAACXATGCT TCTCATTCCC TTGACTCATG TCTTGCCAGG ATGGTTAGAT ' 660 

10 

ACACAGCATC TTCATTTG6T CACCTAAAAA GAAGAAAAGG ACTAACAAGC TTCACTTTTA 720 

TGAACAACTA TTTTGAGAAC ATGCACAATA GTATGTTTTT ATTACTGGTT TAATGGAGTA 780 

, 15 A'TCGTACTTT TATTCTTTCT TGATAGAAAC CTGCTTACAT TTAACCAAGC TTCTATTATG 840 

CCTTTTTCTA ACACAGACTT TCTTCACTGT CTTTCATTTA AAAAGAAATT AATGCTCTTA 900 

AGATATATAT TTTAYGTAGT GCTGACAGGA CCCACTCTTT CATTGAAAGG TGATGAAAAT 960 

20 

CAAATAAAGA ATCTCTTCAC ATGARAAAAA AAAAAA 996 



25 





(2) INFORMATION FOR SEQ ID NO: /j: 




30 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI^SS: double 

(D) TOPOLOGY: linear 




35 


(xi) SEQUENCE DESCRIPTI<»J : SEQ ID NO: 73: 
GGCACGAGGG GCTTTGCGTA CACAATAGCT GCTAGGAGTA CCCAAAGCCT GARTACARCC 


60 




TGCTGGTOTC A1GGCCACGT GTGAGCAGGC CAGCGTCAMA CGGCTCGCTG TGACCCGTCC 


120 


40 


CGRAGACTCA AATOQGCCTG QGTCTTCTCC TRGTCCTGTG ATWAAASlfi'CC TCTCTl'GAAA 


180 




GTGGAGAGCA AAGGCACACA GAGGTGCXKX3 CTCACAAGAA TTCCTCCCGG TGACTGQGTA 


240 


45 


ATCAATGTTA CTGCTCTTTC CPTTGCAGGA AAGACCACAG CAAGATTCIT TCATTOSTCT 
CCTCCTAGCC TGGGGGACCA GGCTCGAACT GACCXnXSGAC ATCAAAGGAG QGATTATCTG 


300 
360 




GCTCCTAAAG CCATCGGCCC ACAGCCCTGT TCACRTCTTG GfTGCTTCTCT TTCCCAGAGG 


420 


50 


CTCGICCCAG CCAGGCACAC ACAAAAGGCA GATTCTCGTA AACSCAGCCT CCCTCCCTGG 


480 




AGGCTGCCTC CTGCCCTGGA TCTGGAGTGG AGCTGCTCTG AGATTTTGAG TTCTTCTGCA 


540 


55 


GAGATCATTA AATATATCCA AGAGACATTG GAAAACCTGC T6AACATTTT ACATTGGTCT 
GCTCAGCACA TGGCTGGATG CGGATAITTC TATAATTCCA GAAAGTrCACA CAGCTCCTCT 


600 
660 




GTATCAGACC AGTGGGCGCC ATTTAAAAGA ACAGGATGAG AATCTAAGAT ATATTATTAA 


720 


60 


TAAATGTAAT GGATTTTTTT TTTGTAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 


780 
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AAAAA 



785 



20 



<2) INFORMATION FOR SEQ ID IK): 74: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TCCTCACCAT TCCCCTAGGN CAGGTCCCTG CAGGTCCCAC ACTTCTCCCA GGTCCCTAAA 60 

CTTCQCTCGG TCCTTTCCCT GGAGTAGCTG GNTCCTCCAG TCGAGGTCCC TGTTCAGTCG 120 

GTTCTTAGGC TCCTGCACAT GAAGGTGTGT GCCTGTGGTG TGTGGGCTGC TCTAGGAGCA 180 

GATACAGGCT GGTATAGAGG ATGCAGAAAG GTAGGGCAGT ATGTTTAAGT CCAGACTTQG 240 

25 CACATGGCTA GGGATACTGC TCACTAGCTG TGGAGGTCCT CAGGAGTGGA GAGAATGAGT 300 

AGGAGGGCAG AAGCTTCCAT TTTTGTCCTT CCTAAGACCC TGTTATrTGT GTTATTTCCT 360 

GCCTTTCCGA GTCCTGCAGT GGGCTGCCCT GTACCCTGAA CCTCATGAGC CTCTAAGGGA 420 

AAQGAGGAAC AATTAGGACG TGGCAATGAG ACCTGGCAGG GCAGARTACA AGCCCAGCAC 480 

CAGTGrrCCCA GCCTTACTGG GTCCITACCC TGGQCCAAAC AGGGAGGGCT GATACCTCCT 540 

35 TGCTCTTCCT AGATCCCCAC CTCCTACAAT CTCAfiCCCAC AAGTCCTCTC CACCCTAGGG 600 

GGCTTGCTGC ATOGCAATAA CTCATAATCT GATTTGGAQG TTTGCCCTTT ACAQGGGCAG 660 

ATTTTCTCCT CAGTTCAACA ATGAAATGAA GAGGAACTCC CTCTTTCTAC AGCTCACTTC 720 

TATCAGAGGC CCAGGTGCCT CAGAGCCACA TTGAGTTGCT TTTTCTGGGA TGAGGAAGTA 780 

GQGTTAAACT CCCCAGTTTC CTGAGQGAGG CTCCTGACAG GTGCCCTTTG TCAGACCCTA 840 

45 CCACAGCCTC GATAGGCAGC CACATTGGTC CTCGCCCTTG CTCGGNACTC CGTGGTQGTC 900 

CTGCCCTTCT CCCTGCATQC CTOTOGGTCT GCTCTGGTGT GTGAAGGTCG GTQGGTTAAC 960 

TGTCTGCCTA CTGAACCTQG CAAATAAACA TCACCCIGCA AAGCCAAAAA AAAAAAAAAA 1020 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 1069 



30 



40 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 75: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 base pairs 
60 <B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

5 

GGACATTAGA TCACTGTQGA CCTAAAACAA ACAAACAACT ATAAGGAAAA TGGCATTAGA 60 

AATGGTCTGG GGATCAGTTT ATCACTGCAG TTGTTACATC ACCCCATGGT CTAAAATACA 120 

10 GAGCTTTAGT CTGTCTCTGT TTCAGTTCAT TTTACAGGAG GTGAACATCA CACTTCCAGA 180 

AAACTCTGTC TGGTATGAAA GGTATAAATT TGATATTCCT GTCTTTCACT TGAATGGCCA 240 

GTTTCTGATG ATGCATCGAG TAAACACCTC AAAACTTGAA AAACAGCTCC TGAAACTTGA 300 

15 

GCAGCAAAGT ACTQGAPGCT GACTGATGCC CTCATGATTT TCCACCCTCT CTTCCCATAA 360 

AGCATCTTCC TAAGGAAAT6 AMCATGGCCT GATACTCATT TTGTCACTTG TACAGAGCCC 420 

20 TAAGGATGTT CTGAATTCAG TGGTGCCAAA TAAATGTTGA CATTCCCCTT TTGGTTGATG 480 

GAAGTATCAG TGTQGGAACT GTTTGCTTAA TQGCATTTTA TAAAATAAKA AKAKCATATT 540 

AGCAGGGAGG GAGATGATGG AGGGAGQGAG AAGTCCATTT GTCTTATTTA TCCTTTTTGT 600 

25 

ATTAATAGAG AAGCACTTCA CAGTCACTGG CAATGCCATT TATAGGAAGA AGGTTCTGCA 660 

TTCCTGCTGC TCCCGGAGGG CTTAACTTTT TAATGAAAGA ATAAATGCTC TTCCACTCAG 720 

30 TAGATAAAGT GAAATQTGAA TTGTTAATAA CTGTGCACGG TCAATAAAGC GATGTTTTAA 780 

GGAATACAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAACTCG A 831 



35 

'{2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

TATATATAGA CNGTTAATAG TCGTGANTGN TGTGNACGAA CATTAACGGA AGTAGCATGT 60 

AGCCAGTCGA ATAACNTATA AGGACAAAGT GGAGTCCACG CGTGCGGCCG TCTAGACTAG 120 

50 

TGGATCCCCC GGCTGCAGGA TTCGGCAOGA GCTGCCAGGT GftGGAGCAGA GAGACTGTTC 180 

CCTTGGGTGG AGAOGTGTGG GCATGA6AGC CACCCATTGC CAAGCAGCAA GAATGTTCGT 240 

55 GCTTTTTTCC CTTCCAAAAT ATGCAGQGCT CAGGCTCCCA ATTCCGGGCC TGTCTGCTTT 300 

GCTTGTGTTT CTCCTGTCCC TGTTCTCCCG GAGGGCCCAG GTGGAACTCA CGACAGGGAG 360 

GGAGACQCTT CCCAAAAACC TGCAGGGCTA TTTCCCAGAA TTTGGTTTTC AAGTACAAAA 420 

60 
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CTTTTTGTCC TGTAAGATAT ATGCAGCCTC ACAGAAGCAG CXnXTTGCCTC CACTTTACCA 480 
GCTACGTTTT TATCTTAAGC ACATQGGGCT OCCTTAGAAC TTACTCCACT GATTTAAAAA" 540 
5 AAAAAAAAAA AAACTCGAGG GGGGGCCXTGG TACCCATTCG CCCTAAAAGT 590 



10 (2) INFORMATION FOR SEQ ID NO: 77: 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1274 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

GAGCCACCAC ACCTGGCCTG GAAGG2VACCT CTTAAAATCA GTTTACGTCT TGTATTTTGT 60 

TCTGTGATGG AQGACACTGG AGAGAGTTGC TATTCCAGTC AATCATGTCG AGTCACTGGA 120 

CTCTGAAAAT CCTATTGGTT CCTTTATTTT ATTTGAGTTT AGAGTTCCCT TCTGGGTTTG 180 

TATTATGTCT GGCAAATGAC CTGGGTTATC ACTTTTCCTC CAGGGTTAGA TCATAGATCT 240 

TGGAAACTCC TTAGAGAGCA TTTTGCTCCT ACCAAGGATC AGATACTGGA GCCCCACATA 300 

ATAGATTTCA TTTCACTCTA GCCTACATAG AGCTTTCTGT TGCTGTCTCT TGCCATGCAC 360 

TTGTGCGGTG ATTACACACT TGACAGTACC AGGAGACAAA TGACTTACAG ATCCCCC3GAC 420 

ATGCCTCTTC CCCTTGGCAA GCTCAGTTGC CCTGATAGTA GCATGTTTCT GTTTCTGATG 480 

TACCTTTTTT CTCTTCTTCT TTGCATCAGC CAATTCCCAG AATTTCCCCA GGCAATTTGT 540 

AGAGGACCTT TTTGGGGTCC TATATtSAGCC ATGTCCTCAA AGCTITTAAA CCTCCTTGCT 600 

CTCCTACAAT ATTCAGTACA TGACCACTGT CATCCTAGAA GGCTTCTGAA AAGAGGGGCA 660 

AGAGCCACTC TGCQCCACAA AQGTTGQGGT CCATCTTCTC TCCGAGGTTG TGAAAGTTTT 720 

CAAATTGTAC TAATAGGSTG GQGCCCTGAC TrOGCTGTGG QCTTTGGGAG QGGTAAGCTG 780 

CTTTCTAGAT CTCTCCCAGT GAGQCA3X3GA GGTQTTTCTG AATTTTGTCT ACCTCACAQG 840 

GATGTTGTGA GGCTTGAAAA GGTCAAAAAA TQATGGCCCC TTGAGCTCTT TGTAAGAAA6 900 

GTAGATGAAA TATCGGATGT AATCT6AAAA AAAGATAAAA TGTGACTTCC CCTGCTCTGT 960 

GCAGCAGTCG GGCTGGATGC TCTGTGGCCT TTCTTGGGTC CTCATGCCAC CCCACAGCTC 1020 

CCAGGAACCT TGAAGCCAAT CTGGGGGACT TTCAGATGTT TGACAAAGAG GTACCAGGCA 1080 

AACTTCCOXSC TACACATGCC CTGAATGAAT TGCTAAATTT CAAAGGAAAT GGACCCTGCT 1140 

TTTAAGGATG TACAAAAGTA TGTCTQCATC GATGTCTGTA CTGTAAATTT CTAATTTATC 1200 

ACTGTACAAA GAAAACCCCT TQCTATTTAA TTTTGTATTA AAGGAAAATA AAGTTTTGTT 1260 
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TGTTAAAAAA MAA 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1133 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
AGGATTTTTC CTTGTTCAAC CAAAATCTGA GCATTCTTTC TATGTTGAAA ACACTGAAAA 
ACTAATTTWA GTTAATGAAC TAGAAAGAAT ATTGATTTTW AAGAAACAGA AAAATACTAC 
TTATTTTCCT TCTCAAATAA CGTTTCTTTC AAAAACTTCT QGCTGAAGTA TAACATGCTG 
GTAGTTAACA TAAATCTTGT CTTTCTCTTG TTCTTTATCT TTCTTTGTTA TTTAGATGCT 
TGTATAAATG TCTTTTGTTT TTATTAAGTG CCTAATTGAC AGAGCTTAAT TTGAAGAAGT 
GCCCTAATTT ATTGACCACT TAAGAATTGC CTTTATTGGG GTATTTTATT TGTTCCTGCG 
TCTTTTTGAT GTTGTTCAGT CTACTCATCC CTGTGftGTAT GTGTGGGGGA CAGCTGATAG 
AAGGGAGGAG AGTGTGTCTA TGCTCAQGAT TGCCCTTTAG CCACTCAGCC AGAGATCCAC 
AGGGAGCAAC AAGGACAGTT TCACATGCTT AGACTTTCTT GGAAGAAACA GTGAGGAGGA 
GTAAGTOGTG AGTAGTGTCA AQCTGGATGT AGAATTGTCC TAAGGCAGTT GACCCCACCT 
TCCAACATGT TTTCACTTTA TTTGCCCCTC CCTACATTTG GGTTAGGTTC CATTTGGATT 
TGCAGCAATA ATGACTTTAT TTCTCTCTTG GTCAGGATTT G(X:ACATAAA ATCCTTTTAT 
TATAGAACTA GCTATTTTAG TTACATAGTA ATGTAACTAA TGGAGAGATT TATAGAGAAT 
TTTGKTTTTG CTGTCATATA TGTCCATTTT GGAGACAGAT ATGATAGAAC TAGAAATTAA 
GTTGCATTTC TGCAAGTGCC ATTTGAATGA ACTTCAAGTA TCTTCTTAAT TAITAAATTT 
TCTGATGAAG GCATTGTAAC AAATATATAG TATTATTAAA TCTAATTAAT ATTTGGAAAT 
ATTAATAAAT AGGTATTTTA rrTACTOTAA AAAGTCAAAC TTCATTATGT AGATAAATCT 
TATTCTTTTC ATTCTTTCCC CTGTTTACAT CCTTTTTACA AAGCTTAGTC ACCAATTAAA 
GCTTTCCTAT CAAAAAAAAA AAAAAAAAAA ACTCGAGACT AGTTCTCTCT CCT 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 661 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
GAATTCGGCA CGAGGQGAAA AGGATGCTGA ACGAGAGCAG AAAGCCTCTT TCCTTTGCTT 60 
10 CACGCCTTTC CAGTCTTTAT TTTAAACTCG GGTTCCCTTT CTGTGGTCGC AGCAACCTTT 120 
ACTCCACCTG CACTGCTGCT CCTGGGGGCT CCCCAGGCCT CCCTCTGCCT TTCTACCCAG 180 
TGGCTGACGG GATGCCTGTC TTGCCTGGAC GCACCACTGC TCTCCTGTCC CTCACCTTGG 240 

15 

CTTTTGCTGT GCCCTGCTCT GGGGTTGAAG CTGGCCCATG TGTCCCCCGG AGTCATGGCT 300 
GCrCCTCCTG GGAQGCCTCT GTGTGCGTCA CGTCTTCCAC ACCTGGGGGC AGCTGGCGAG 360 
20 CCCGTGCTCT GTTCCCCTCG GCTGCTTGGC ACAGftGVTGC AGCCTGGGAY TCTCCGTGGA 420 
CCCAGACTGG GGATTTTGCC AGGGGGGCGA TGQGAGGAGC AGGTCCTTTG CCTGGGGGCT 480 
GTGTCTGCAT TTCTGGACGC CCCAGAGCAC AGAAGTTGCC GGCACTTTGA GGTCTTCCTC 540 

25 

GGCATGTGCC AGATTACATG AGTGACGGCT GGGAATATGT TTTCTTTTTT GTAATGGAGG 600 
CGTGTTTCAC ATATAGTAAA GCTCACCAAA AAGTAAAAAA AAAAAAAAAA AAAAAACTCG 660 
30 A 661 



35 . (2) INFORMATION FOR SEQ ID IK): 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:* i378 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

45 ATTGGGTACC GGGCCCCCCC TCGAAGTTTT TTTTTTTTTT TTTTAATGAA AGCTCTCAAA 60 

TAAGCGATTT TATTCCTATC CATGATTGCA GACATTTACA AAACCATAAC ATCTGAGTTC 120 

ACCTTAAAAA ATAACTTATA TAAAGCAGTG ATATACACAG CACAAAATAG TTCAGGGAGG 180 

50 

GGGCAGGAGC AACTTGTAAT AATTAAAATG TAAACGTGAA AAAAAGGATG GAATAAAAGT 240 

CCCTACTTAT TTCTACTTAA GATGTCATGT GATAATATTT TACAATGTCC TGTGGGTCAA 300 

55 TGTATGTATG TGTATATGTC TGTATAACAT ACACATATAC AGTACATTCT CTTTCCCACA 360 

CATATACATA CACACATAAT TATTTGCAGT TCAGTTTAGG GCAATTCTAA TATGCCACTC 420 

CX5TACAGTTG TTTGAATCAC ATTTGGACCC GCTTTCTTCA CAAAAGAGGG GAGAGAGCAG 480 



wo 98/54963 



335 



PCT/US98/11422 



GAAATAAAAA GGTTGCSTrTG GTCTGACKaA GATTCCTTTG TTTAACTCTA CACTGTGATG 540 



AATAATTTTC TTCCGTAGTA GTTCTGTGAA GGGCTGACTC ACTGTGGTTT TCATGAGGAG 600 



5 ACTTGGTAAT GGATCACACG CTCATTGTCA TGCTAGGGGA GTAACTCTCA CTCTGAAAAG 660 



GATTTAAGAA ATTTCCGXTCC ATTTCGCCAT CATCCCTTGG AGTGCCCGGT TGATTACTCA 720 

QGCTCATATT ATTGGGAGAA TTCTTGGAAA TACTGTCCAT ATCTCCTGAG CCTAAAGAGC 780 

10 

CATTCATGTO ATGTGACTCC ATTCCTCCTA ATCCACCCAT GGGACCATCT GACCCAGGRC 840 



CCATTGGAAA ATTAGGTCTG TTAGGTCCAG GAGGTACTGC ATTCATTAAA GTATACATGT 900 



15 TATCACCAGA GTTGGTTGAA TCTGCTGGAC TAGGCATGAT GGGTGTTCCT GGTGGCCCTC 960 



CACCTCCTGG AQGACCTACA TAATTCCCAG GAGATGCTGA GGAGTATGGT ATTGAATTGG 1020 



CATTTGTTGG GTTTGGCCAA GGTCTACCAC CACCTGGACC CATGTTCATT CCAGGCATTC 1080 

20 

CAGGGCCACC TAAAGCATTC AGTGGGGGTC TCATTGCACC TCCATAGTTC TGTGGTCCTA 1140 



AQGGCACCAT TCCTCTTGGA GGAGTCATTC TCTGCATTGG CCCACCCATA TTTGGATGTC 1200 
25 CTTGTTGTCX3 AGTTGGATCC ATTCCACTGG GGAGTAATGG CTGACTTCCT GGGACACCTC 1260 



CAAGTGCCTG ATTAGGTATC CTCAATGGGG GCCTTGGACC TCCAGQGTAC CGAQGTGACA 1320 



TAAAAGQGTA ATCATGGAAG GCT T TTOC T T CaCTTGAGTG TTCACATGTT TCACGTCT 1378 

30 



(2) INFORMATION FOR SEQ ID NO: 81: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 <D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
ACTTTGTCCA AATGTGn C TG TCACATGTAG TCAGCTGNAG NAATTTAAAA TGAATTGCCA 60 

45 

AGTGAAGAGT CTGTGGATTA ATTGGCCGTT AATTAACAG6 CTTTATCAAT GTGTCCTCAA 120 



GQGASAGGCC CAACCCTAAT TAAGGAGCTA AACTTCCTGA GTGAGGGGCT GTGAGGATGG 180 
50 AGGTQGAGGA GGCATCTQGG GCGGGTGGTG GCCQGGCCAG CAGATQGCGC CTCCCTGGCT 240 



GAGCTGCCCG CACCGCCAGT TCCCTCATTT CCACTCAGGA AGGCAGAGAA GGCAGAGTGA 300 



TCTCCTCAAG GAAGAGCTTC CCCAGCCTTC GGGAGCAGCT GGCAGGGCGT CCGQGAATAA 360 

55 

GCCCTACACG CCQCOGCCTG CCTCCAACTC ACTAACCCT6 CGCCTCTTGT CTTTCAGATT 420 



CAACGCGTTC AACAGAAGCC ATCCCCAGCC CAGCTTAAAT TATAAAGATA GACAATAACT 480 
60 CTGTTCCAAT CTGCGTGGTG CTTCTTTAGT AAATACTGTA CAGATTTTAC CATG6AGAAC 540 
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TTTTTTTTTA GTTTTTACCT TTTCTTAATT ACCCTTATTC CGAATGGACG AACACTTTCT 600 

ACCACTGCTG ACCATTGTAA AATACCXTTGT ATATAAATCC CATTGAAATA ATGCCCTQGA 660 

5 

ATAGAACATC TCAAATGCTG CTTAATTACA GACTCAGGTC GATTACTTGT ATTTCATGTA 720 

ATGTTCCTCC AAGTTAGACA TCTGGTGCAA GACCAACCGG GAGACCATQG AATTGTCAAA 780 

10 AGTACAAACT GACAGTGTGT ATATTTAATT TAAAGACTTA TTTAAAAACT CACAAGCTCT 840 

CACCTAGACT TTGGAGAGCA GTCTGTTTTC TGTAATGTCT GATACTAGAA ACTAATTTGC 900 

TTATTTTAGT TGTATTCAAG ATTTGAAGAT GTATTTTATA GACAAGTTCT GTTTTTGAAC 960 

15 

TTTGTGGAAC TGTTCCAATC AATCAATTTC CXIAGTTATGA TGAGTATTTA CATTATGAAT . 1020 

GTATAACCCA GACATGATTT GTAAAGCCGA CAGTATGTTT CTATTACACA ACACTTTTTG 1080 

20 ATACAGCXm: TCTTGTCTTC ACTGATACTG GAGTCTCCGT TGTCTGCNNG GTCCCTTCGA 1140 

GTTTCTAGTT ACAGACACAA TCATACTGTG ATTTTATTTT TAATATGGAT ATGCTATCAA 1200 

ACTGTGATAC ACTTATAATT CACTGGTCCT GCATCAGGAG ATGGAGTGGG GAAAACTGTA 1260 

25 

TTTAATACAG TTTOTATCTG AATAATCTGT ATGGTTTATA CAGTTTGTGT TGTTCAGAGA 1320 

TGTTTAAAGT TTGATCTTTG TTTTTCTAAA GATTAAAAAA GCACTTGCCC CACTGTAAAT 1380 

30 ATACAGCATG TAAAATTTCT RTAGTATATA AATGGCAGCA AATCACAAAA AAAAAAAAAN 1440 



35 (2) INFORMATION FOR SEQ ID NO: 82: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1381 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOUXSY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 82: 

45 CCCGGGCTGC AGGAATTCGK YACGAGGCCA GCAGTTGCTC CCAGTTCAGG AGGTGCTCCT 60 

GTACCCTGGC CACAGCCCAA TCCTGCCACT GCTGACATCT GGGGAGACTT TACCAAATCT 120 

ACAQGATCAA GTTCCAGCCA GACOCAGCCA GGCACAGGCT GGGTCCAGTT CTGACCTGAG 180 

50 

CAOGGnTTT CCTCATGTGA CTTCTGGGAA GGCGCTCCCT CATCTGGGCC AAAGGAAGGA 240 

QGACGAAGCC CTCCTCAGCT GGCCTGTGTT TGGGQCATGA ATCTCTCCTC TCCTCCTTGT 300 

55 CTGGCTCTGT TGACAAACCG GGCATGTTTG GCAGTAAATT GGCACCGTGT CACACTGTTT 360 

CCTGGGATTC AAGTATGCAA CCAGAACACA GGAGAAGAAA AGCTCCAGGA TCCCTGTCCC 420 

CATCTGTCCT CTTGATOTGA GAGAGACTCT GAGACTTCTT CCATCGCAAT GACCTGTATT 480 

60 
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AAACACAAGC CCCCCAAGCA AAAGAAGAGG TTGAGTTTGC TGCCAGGATT CAGATCAGCC 540 

CTTCCCAGGG TCTGCAQGTG TCACATGATC ACAGTTCAGC QGGAGGCTTT CCGTACCCAC 600 

5 ACTGGCTGTA GCACTTCAGT CCATCTGCCC TCCAGAGGAG GGTTTCTTCC TGATTTTTAG 660 

CAGGTTTAGA GGCTGCAGCT TGAGCTACAA TCAGGAGGGA AATTGGAAGG ATTAGCAGCT 720 
TTTAAAAATG TTTAAATATT TTGCTTTGCT AATOTGCTGA TCCGCACTAA CTCATCTTTG - 780 

10 

CAAAAGGAAC TGXnCCCTCG GCGTGCCCCA GCTGGGGCCT CTGAAGGGAT TCCTCACTGT 840 

GGGCAGCTGC CCTGAGCTTC AGGCAGCAGT GTTCATCTCT GGCCAGTTGT CTGGTTTCCA 900 

15 TGTATTCTAG GCCAGGTAC3G CAACACAGAG CXMGGCGGG TGCTGGAAGC CAGACGGAAC 960 

AGTGTTGGGG CAGGAAGGTG GATGCTGTTG TCATGGAGCT GTGGGAGTTG GCACTCTGTC 1020 

TGCTGGTGGC CCTCTCGGCT CACATGTTCA CAGTGCAGCT CCTGGCAGAC TTGGGTTTTC 1080. 

20 

TCTTTGGTGG TTTCTAAAGT GCCTTATCTG CAAACAACTT CTTTTCTCCT TCAGGAACTG 1140 

TGAATGGCTA GAAGAAGGAG CTCAGTAAAC TAGAAGTCCA GGGTTGCTTG GTTTACTGGT 1200 

25 TTATAAGAAA TCT6AAAGCA CCTCTGACAT TCCTTTTATT AACTCACCTC TCAGTTGAAA 1260 

GATTTCTTCT TTGAAAGGTC AAGACCGTGA ACTGAAAAAA GTGTTGGCCT TTTTGCQGGA 1320 

CCAGA3TTTT AAGATAAAAT AAATATTTTT ACTTCTGTCA AAAAAAAAAA AAAAAAATZ4T 1380 

30 

C 1381 



35 

(2) INFORMATION FOR SEQ ID NO: 83: 

- (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1706 base paijTS 
40 (B) TYPE: nucleic acid 

(C) STRANDECNESS: double 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: '83: 

45 

ACTGCACCAC TGCCCAOGTC TCCCGGCTGG ATGAAGACGT GGTCCATGAG GAAGCTGGCT 60 

AOCTCAGACT GGAGAGTAGC TTCAGGAAAA AAGACAAGTG GCCTAAGGAA ATCACGGCCC 120 

50 CCAACTATCA TGTGAQGGCT AAAGATGAGA AGTAGATCAC TTAATAAGAC AAAAGCCTGT 180 

AGGGGGAAAA GAAAGGATGT TTAAAAGGAC AGAATGTTTC CCAAGGTAGA AATGACACTG 240 

TCAATTTCTC CTTGGAATGG GGGCAGGGAT ACTCGCCTTG TTGCTCCCAC TTGAGTCAGT 300 

55 

ACTCACCTGC TCCTGGATCT CAGTATCCAC ATCTGAGAGG CAACTCTGGC AGAGTTCACA 360 

GAAQQCCACC ATTCTGTCCC TCAAACTCGA CAGCTGCTTC TGTGGGCACA GTGGCTTG^ 420 

60 GGGGAAGAAT GAAGACACAG ACTCCTCTGT TCCCATTATC CCATCTAAtSA CCCACACTCA 480 
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CCTQGGGAAG CATCTGATTT AGAAATGTGG GTTAGTGTCC AGAGAATGGA AAAATAGACA 540 

AGAGTCAAGG CTGGCAGGAT AACCTGTAAC AACAAAGQGT TTGAAAAATG AGGTTTGGGT 600 

5 

TAGGAGAGGG AGAGACAGAT AGCCAGAAAC ACACCAGTGA AGAGGAGAGA AAATGAGTAA 660 



AGGGAGAGCT AATTCCTTTT CCAGTGGAAA ATGAGTGATA TTCTGGACAT TCTTCAGAGG 720 
10 CATCTACACG AAGTAGAAAT GTCACCGCTC CCTAATTTAC TCTACX5TCTT CTAGAATCCC 780 



TCAATATTAT CCTTGGCTTC CAGGAAATCC AAGAAGACCC TGGAAGTAGA GTCCACCTTC 840 



TAAGAGAGGA ATGTAAGAGG TGACCCCCAC CCACCT6ATC TTCCTCGCTT TGTCCACTCC 900 

15 

ACGCACTGAG ACTTGACACA CCTAGTGGCC ACCTAGAACG TAGGTCCTTA AAATVTAGCC 960 



CCCCAGCCXrC CAACCCATCT CTAGCCTGTC CACTCACCTG GTGAGGAACY TYTCCTGTGT 1020 
20 CCACAGCYTT CTGCAGGAGT TGGCAACATG GCTCATAGA6 CTCCCAGCGA GTCAGGTCAT 1080 
GAGTGCTTTG GGGGAGAAAG GGGAATGTTA TACTGGAAAA GAACAGAGGG AACCAACTCC 1140 



ACAGACACCA GTAAAAACGG GATGGGGAAG AGGAGGAAAG CCACTCACTT GTAGAAGGCA 1200 

25 

GAGAGGCGTT TCAGAGTGGC TGCCAGATTA TATACCTCAT CCTCATCTAG GAAGGACGAC 1260 

TGAGAAGGAA AGAAGATCCA CAATAGCATT TCCCCCAGAA CTCATCAGTC CACATCCCCC 1320 

30 GTCTTGCAGC CCCTCCCACC CTrGTTTQGG GTCTCCCATT GTCCAGCCCC AGCTCCTACC 1380 

TGTAACAGCT CTTCAAGCTC CTGCTGGAAR CGGTCAGTCA GCAAATCTAC TAGCTGGCTG 1440 



CGGGCAAAGT CCGCCCGGCT GAAGAAAGTG AATTCGGGAT TACAGAGCAG GTAAGAGCAT 1500 

35 

GCGCCCCAGC CTCAAGCACC GCTGGCTCTG CATGCTTCAC CACCACCTCC TGGAGTTGCT 1560 

GCAGGAACAG CTCCAGGTGC TGAGAAGAAA AGGCAGAAGA TGGTGTGCTG TGGGGATGGG 1620 

40 AGGAGGACAC TCTTCTGGCG GGAAGTGGAA CGGGGTTAAA AGCATTAAAC TTCAAGGATA 1680 

AGATGCCTAA RAAAAAAAAA AAAAAA 1*706 



45 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 573 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

GAATTCGGCA CGAGCTTGGT AGCCTTAGAA CTGCATGAGC TGCTTTACCA CTGGGAAACA 60 



CGAGCACAGC CTAGCTTGAT TTTGTATGTG GTATCAGATC TAAGGTGGAT GGAATTCAGG 120 

60 
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ACTTCCTGTC TACTCTTTGA rETTGTTTTA 
ATTTATTCAT CTTCAGAGAC ATGGTCTGGC 
5 ATCATAGGCC ACTGCAGTGT TGAGCTCCCG 
TTAGTAGCTG GGACTATAGG CACATGCCCT 
GATGTCYCAA ACTAGAAGGT CTATTAATTT 

10 

AAATAATAAC AGTGGGAAAA GCXIACCTTCC 
AAAACGAAAA ATAAATAATA GGAAAAAAAG 
15 AAAAAAAAAA AAAAACTCGA GGGGGGCCC6 



339 

TTTTTAGAAA TGTTTTATTT TGTTTTATTC 180 

TCTGTTGCCC AGGATGGAGT GCATGGTGTG 240 

GGCTCAGGCG ATCCTCCTGC CTCAGCTYCC 300 

ACCATGCCTG GCTTTGTCTA CTTTTTGAAT 360 
AAAAAATTAA GGATAGCATG CCATAATTAA - 420 

AATGATTCAG ACATCAACTT GTGA1TTAAA 480 

GGGAAAAAGT TAAATAAAAA TAAAATTAAA 540 

GTA 573 



20 (2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 684 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

30 CTCTTTGGCT GTCTCTACCT CCTTCATCTG CTGCGCCGAC ATAAGCACCG CCCTGCCCCT 60 

AQGCTCCAGC CGTCCCGCAC CAGCCCCCAG GCACCGAGAG CACGAGCATG GGCACCAAGC 120 

CAGGCCTCCC AGGCTGCTCT YCACGTCCCT TATGCCACTA TCAACACCAG CTGCYGCCCA .180 

35 

GCTACTTTGG ACACAGCTCA CCCCCATGGG GGGCCGTCCT GGTGGGCGTC ACTCCCCACC 240 

CACGCTGCAC ACCGGCCCCA GQGCCCTGCC GCCTGGGCCT CCACACCCAT CCCTGCACGT 300 



40 GGCAGCTTTG TCTCTGTTGA GAATGGACTC TACGCTCAGG CAGGGGAGAR GCCTCCTCAC 360 



ACTGGTCCCG GCCTCACTCT TTTCCCTGAC CCTCGGGGGC CCAGGGCCAT GGAAGGACCC 420 



TTAGGAGTTC GATGftGAGAG ACCATGAGGC CACTGGGCTT TCCCCCTCCC AOSCCTCCTG 480 

45 . 

GGTGTCATCC CCTTACTTTA ATTCTTGGGC CTCCAATAAG TGTCCCATAG GTGTCTGGCC 540 



AGGCCCACCT GCTOCGGATG TGGTCTGTGT GCGTGTGTGG GCACAGGTGT GAGTGTGTGA 600 
50 GTCACAGTTA CCCCATTTCA GTCATTTCCT GCTGCAACTA AGTCAGCAAC ACAGTTTCTC 660 
TGAAAAAAAA AAAAAAAAAA AAAC 684 



55 

(2) INFORMATION FOR SEQ ID NO: 86: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1036 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

TGGAGGCAGA TGCACAGGAG AAAGGTTCCC GTCCGCACCC TCTCAGACCT GAGGCTGAGC 60 
TTGCAGTGAG GGCTTCTCCT CGGCCCCTCG CCCGCCCCCA GfiGCTGCCAT CCCTGCTGTT - 120 

10 

ACAAGCCAGA GGAGCCCQGA TGTGAGGCCC CAGATCACCT CCAQGGACTT GGGGTTCCXrA 180 

TCTGAAATCC TTTATTTTTG TACCATQGGG TGGGCCCCGG GCTGAGAAGG AAGAAGCACC 240 

15 CTCTCCCCGG CCTCCTCTGT CTGCACCCGT GGGGCTGTGA CTTACTCCTG CCTCCAQGQG 300 

CGGGGCGGGG CCCCCTGGGA CCTCTTAAGG CCCAACGTGG GCCCCAGGAC CTYTGGGCAG 360 

AGTGGAYTGC TCATGGCAGA TGTGTGGCAA TGTCTGGCTG W3TCTTTCCG GCAMCTGCGT 420 

20 

YCCCTYTCCC GGGVTCCCCT GCTGCATGGT GGATC3TGCTC CTTCCTGGCC CGGTCACATT 480 

GCCTCCTTGA GCCTTAGTCC AGGGGGTCAC TYCTCCCACC CCACCTACCT CACAGGGTTG 540 

25 TTGTGAGGGT GCACfiGAGGA GCAAAGTCCC TGAAGGCCCT CAGGCAGTAT ATAGGGGCCG 600 

CCCACCTTCA GCTGCCCTGG GATGGGAAGG ACCCAGCCCG ACCCCTGQGC ATAACACTGT ' 660 

GTTTGCAAAT GGAGATTCAG GTATTGGGGA TGCAGGTTGT QGGGAGCTGG CCTGGCAGAG 720 

30 

TAGGGGTAGT TGGCTTGGCC TTCTCTTTGG TGATCCCACC CCCAGCCATT TGCATTGCTG 780 

GCCCAGCGCC TGGCCTGGGG GGCGGGGAGA GGCAGCAGAA GGGGCTGGGC AC3GQGCGGTG 840 

35 GAQGACTCAG GAACTOCCCG GGGAGAGTGG GTATCGCGGC TGAGCCAGGG GCCCTCCTGT 900 

GTTTGACTTC CCGQGATQGG TCCTTGCTTC TCAGCTGTGT CCXSACCCCAC CATGTAATAA 960 

AACCCAAAGG AACAGCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAN 1020 

40 

CCCmSGGGG GNCCCG 1036 



45 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE GKARACTERISTICS: 

(A) LENGTH: 908 base pcd.rs 
50 (B) TYPE: nucleic acid 

(C) STRANDEmESS: double 

(D) .TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

TTAAACAAAT GGAATCATGC AATATGTGAC CTTTTGCGTC TQGCTTATTT TATTTAGCAT 60 

AATGTTTTTG AGGTTCATCC AAGCTGTAGC ATGTATCAGC ACCTCATTTC TTTTTCTqpC 120 

60 TCAATATTAT TCCATTATAT GGATTTACCA CAATTCATTT ACCTATTCAT CTTTTGTTTC 180 
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TGCTGTCTGG CTATTGTGAA TAATGCTTCG ATAAACATTC ATATACAAGT TTCTATGTGG 240 

CTTTATGTTT TC^TTTCTCT TGGCTATCTA CATQQGAGTA GAATTCTAGG TCATAATATA 300 

5 

ATTTTATGTT TAACTTCTCA AAGAATTGCC AAAAGGTTTT TCATAGTGGC TGCATCATTT 360 

ACATTCCCAC CGGCAATGTA CAAGGATTTC TATTTTTCCA TATCCTTGCA CTTACCAACA 420 

10 CTTCTTTTTK GTOATWATTT TGTTTTTTCA TTATTGCCAC CCTAGTGGAT GTGAAATGGC 480 

ATCTTATTGT TTTGATTTGC ATTTCTCTAA TGACAAATGA TATCATACTT TTTTTATGTG 540 

CTTACGGATC AAAGGTATTT CCTTGGAGAA ATGTCCCTTC AAGrrCCTTTG (XATTTCAAA 600 

15 

ATTTQGTTAT TTGTCTTTTA TTATTCAGTT TTAAGAAATT GTGGCCAGGC GCAGTGGCTC 660 

ACXrTGTAATC OTAGCACTTT GGGAGGCCAA GGCGGGCAGA TCACTTGAGK TCAGGACTTC 720 

20 GAGACCAGCX: TGGCCAACAT GGTGAAACCC CATCTTACTA AAAATACAAA AATTAGCTGG 780 

GCGTGGTGGC AGGTGCATGT AATCNTATCT ACTCAGGAGG CTGAGGCAGG AGAATCGCTT 840 

GAACCCAGGA GGCGGAGGCT GCAGTGAGCC AAGATCACGC CATTGCACTC TAGCCTQGGT 900 

GACACAGA 908 



25 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 88: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGXi linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

TGCACTGGTT CCTTCTCCCC AGCAAATACT GCCTTCTTGT TTTTCTCTGA TGTGGCAGGT 60 

GACTACAAAA TCCGCCTTGG TATTCTTCAA ATGCATATAT ATTCCTTTCT TGTCAGCTCC 120 

45 CTCTCTTCCT AGATTAGAAA ACTGCCTCAT TTTCTGCTCA CTGGATGTGC AGTCCCAGCT 180 

TGTCTTCCTC TCCTCCCCCC CTC3TTQCAGG ' iWl 'L Ti ' m ' TmT TC TrC TCTCCCCACT 240 

GGGCAGCAAA AGTTGTTCCA CAGTQGAAAW TTAQGCATCC TCAAGTTTCY TCCCAGCTTC 300 

50 

TGCTGTGTTT TCTTAGAGTA AATTGCCAAT TTCTGTTTTT ACAGGAAATC CTTTTTTAAA 360 

AATQGAATCA GTGTGGTCCC CATCTACTCT GCAAAAATTG CATITTTCTC TATTTTCAAA 420 

55 TGAGATTTGT TCAAGTTTCA AAACCACGTG AAATAATAAA TGTATAGTAG TTTTCTTTTC 480 

CTTGQGCATT GCTWSATATG TGAAATGGGT TTATGAAAAA TAATAAAATC ATAACGCTAT 540 
TTGTTTGACT TTCAATTTCA TGGGAATTTT TCTCAGCTAA ACTCTAAATG GTGATTARGC . 600 

60 
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AAAAAAAAAA AAAAAAAACY GRAGGGGC3GC CCX3GTACCAA TTCGCCCTAT AATGA 655 



5 

{2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

15 

TTTT T T T TTT ACCATTTAAA ATAAAATGAA AGTGACCTTC TGTTTATAAA AATCTTTGTC 60 

TGCATCTCTG CTTATTTCCT TAGAAGAGAT TCCAAGAAGC GGTGAGTGAT TTCACGGCA6 l20 

20 CAGAGGCrTTG GGACATATTA CGGGCGOGGA TCCCTCTTGG AGTGAGATGA CTCTCCGGAG 180 

AGATTTAGTC GTC^CCCTCG CGTGTGAGGC TGCGTCACAC CCCAGGGATG TGTCTATCAA 240 

GATGGAAGAT CTTTTACACG CTCTTGATTT TGTTTGSCTY TTTTTCTATT ACTAGTGAGA 300 

25 

AKGAAACTTT TTATATCATT ATTATCCATC ATAATCCAAC ACAAATTACT GCTTCATGTT 360 

CTTTTACTTT CCTGTGAAGG TTTTAGTGCC TTTTAAAAAT TGCTATATAT TAAGCTTGTT 420 

30 AATACTTCCA TQCTOTATTT GTQGSCATCA RTTTCCCCGG OIACAGGCOT GCACATTTTC 480 

CCTTCACACG CTGQGTGGTT TTTCATTTiC AMTTCTATTT CTCGTTCTTC TATCGTTTTA 540 

TGTTCAGACG GGTTTCTCCG TGTAGAAAGC AGTTTATGAA GATTTACTTT CGACAGTCTT 600 

35 

CTCTCTACTT TCTACAGTGA ATTCTCTGAT GTGTCTQGGA GTTTGGGGGT CTGGGTAAGA 660 

RTCCTCCTCT CACCCTATTC TCTATTACGA TCCACAGCCT CATGCTTTAT GARATTQGTG 720 

40 GCCGGGARCG GQGGAGATTT GCGGATCCCC. CAAQCCAGAC TTTATCCCCC TATCCCTGCC 780 

TCTGGATCCC ACGTACAGGC CTGGGAACTC CCTGTGGGTA QGGGCCAATG GTCTCGCACT 840 

CTCACCTGTA CCCCAGGGCT GGCACAGGAT QGTCAAGGAG AGAGGCTGCC CAAGCGCATC 900 

45 

CYTCTGGTGT CCCCCTGACA CGCCTCCAAA GTGAGCAGGT AGGTTTCAAC AGCCCCACGT 960 

TGCAGGTGGG AGATGAAGCT CAGQGTGGAG ACCAGTATCT CACAGTTCTC TTTGCATGGC 1020 

50 CGGGTACTTG TTAGTCAACT GATCAAGTGA AAATTCTAGC CCCAGAGGCA QGAGAATCCG 1080 

GAACAAAATT AAiVCCAGCCA GG 1102 

55 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1533 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANOBDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICM^: SEQ ID NO: 90: 

GGCACGAGCC GNCACGGGCA GCGCCCCATA GCGCCAGGGA CCCCCTGGCA GCGGGAC5CCG 60 
CGGGTCGAGG TTATGGATCC AGCGGGCGGC CCCXX3GGGCG TGCTCCCGCG GCCCTGCCX3G - 120 

TGNCTGGTGC TGCTGAACCC GCGCXSGCX^GC AAGGQCAAGG CCTTGCAGCT CTTCCGGAGT 180 

CACGTGCAGC COCTTTTGGC TGAGGCTGAA ATCTCCTTCA CGCTGATGCT CACTGAGCGG 240 

COGAACCACG CGCGGGARCT GGTGCGGTCG GAGGAGCTGG GGCGCTGGPA CGCTCTGGTG 300 

GTCATGTYTG GAGACGGGCT GATGCACGAG GTGGTGAACG GGCTTCATGG AGCGGCCTGA 360 

CTGGGAGACC GCCATCCAGA AGCCCCTGTG TAGCCTCCCA GCAGGCTCTG GCAACGCSCT 420 

GGCAGCTTCC TTRAACCATT ATGCTGGCTA TRAGCAGGTC ACCAATGAAG ACCTCCTGAC 480 

CAACTGCACG CTATTGCTGT GCCGCCGGCT GCTGTCACCC ATGAACCTGC TGTCTCTGCA 540 

CACGGCTTCG GGGCTGCGCC TCTTCTCTGT GCTCAGCCTG GCCTGGGGCT TCATTGCTGA 600 

TGTGGACCTA GAGAGTGAGA AGTATCGGCG TCTGGGGGAG ATGCGCTTCA CTCTGGGCAC 660 

CTTCCTGCGT CTGGCAGCCC TGCGCACCTA CCGCGGCCGA CTGGCCTACC TCCCTGTAGG 720 

AAGAGTGGGT TCCAAGACAC CTGCCTCCCC CGTTGTGGTC CAGCAGGGCC CGGTAGATGC 780 

ACACCTTGTG CCACTGGAGG AGCCAGTGCC CTCTCACTGG ACAGTGGTGC CCGACGAGGA 840 

CTTTGTGCTA GTCCTGGCAC TGCTGCACTC GCACCTGGGC AGTGAGATGT TTGCTGCACC 900 

CATGGGCCGC TGTGO^CTG GCGTCATGCA TCTGrTCTAC GTGCGGGCGG GAGTGTCTCG 960 

TGCCATGCTG CTGCGCCTCT TCCTGGCCAT GGAGAAGGGC AGGCATATGG AGTATGAATG 1020 

CCCCTACTTG GTATATGTGC CCGTGGTCGC CrTCCXXTTTG GAGCXX:AAGG ATQGGAAAGG 1080 

TGTGTTTGCA GTGQATQGGG AATTGATGGT TAGCGAGGCX: GTGCAGGGCC AGGTGCACCC 1140 

AAACTACTTC TGGATGGTCA GCXOTTGCGT GGAGCCCCCG CCCAGCTGGA AGCCCCAGCA 1200 

GATGCCACCG CCAGAAGAGC CCTTATGACC CCTGGGCCX;C GCTGTGCCTT AGTGTCTACT 1260 

TGCAGGACCC TtCCTCCITC CXTAGQGCTG CAGGGCCTGT CCACAGCTCC TGTQGGGGTG 1320 

GAGGAGACrC CTCTGGAGAA GGGTGAGAAG GTGGAGGCTA TGCTTTGGOG OGACAGGCCA 1380 

GAA3X3AAGTC CTGQGTCAGG AGCCCAGCTG GCTGGGOCCA GCTGCCTATG TAAQGCTTTC " 1440 

TAGTTTGTTC TGAGACCCXrC ACCCXMGAA CCAAATCCAA ATAAAGTOAC ATTCCCAAAA 1500 

AAAAAAAAAA AAAAAAAAAA ANCCXX3NGGG GGG 1533 
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(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEIMESS: double 

(D) TOPOIiOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

ATCCTCTGGA ATCTAGGTGG AAGCCACCAA GCCTTCTTCA CACTTGCGTT CTGAGCATCT 60 

GCAGACTTAA CCCCATGTGG CAATCACCAA GGCTTATGGC TTGTGTCCTC CAGAACTGTG 120 

15 

GCCAGAGCTG TACCTGGGCC CCTTTGAGCT GAGGCTGAAG CCAGAGTCTG AAGCTCAGCA 180 

GGGCAGTARG GCCCTGQGCC TGGCCCCTGA AACCATTCTT TTCTCCTAAG CCTCTGGQCC 240 

20 TTTGATGGGA RGGGCTGTCC TCAAGATTTT TGAAATGCCT TTGGAGGGTT TTTGCCTTGT 300 

CTTGGATATT GGCTTCCTTT TAGTTATGCT CATCTCTCTA GCAAGTGAAT GITTCACAAC 360 

CTGCTTGGAT TCTTTCTCTA CCACAGARCC AGGCTGCAAA TTTTACAAAC TTTTACACTC 420 

25 

TGTTTCCCTT TTAAATATAA ATTTCAATGT TAAGTCACTT CTTTGCTCCC ATATCTGATT 480 

TAGGTTQCTG GAAGTAGCCA AGTCACCTCT TGAATGCTTT GCTGCTTAGA AATTTCCTCT 540 

30 ACTAGGTAGC CTGGGTCATC ACACTTAAGT TCAAA 575 



35 (2) INFORBIATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:' 639 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEl^SS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

45 TCCTTTCATC TTAAGCACCA CCCGACAGGG CAGGTACTAT TACCATCTCC GTTTGACAGA 60 

TNAGGAACCT GGCACAGGAA GCATTTAAGT GGATTCCCCA GGATCGCCCC ACTGTCAGGA 120 

GCAGANTCAG AATGGGCCTC AOCATCAGGC TCCCAATCCT GGCTTCTAAC TGCTGCGCTC 180 

50 

TGCCCTTCYC TCWCCCCACC TCCCCACTCC AGTGCCTTTG GTCATGOCAC TGCAGCTTTC 240 

AGGCCAATAC TGGATTAGCC TCTTAGTGTT CTTGTCCCTG CAGCCATTTC CCCAGGCAGC 300 

55 AATTCCATGT GCCCTCACTG ATGTAGGrTGG CTCTTGTGTC ATTTGTCACA TCCTATTQAA 360 

TTGTTTATGC ATCTTGTTCA CACTCACAGC ACCCTCCCTC TCACACGTCC TCCTTATAAA 420 

AATGTCCCTC AGTGTCTGCT ATGAGCCAGG TGCAGACTTA AGTGACAGGG CTGCTACGGG 480 

60 . 
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AAATAAAAAA TTAACAAGGA GCACCTGCCT CTTAATCCAC AGTMCAAAC TATGTTAAGT 540 

GTCAGGAAGG AAAGGTTAAG GATGCCAOGA AGGCTTTTAA TAAATAACCT GACTTAGATG 600 

5 GGCAGGTGGT GCTGARGATT AAGAACGTGT TCTTCTCGA 639 



10 (2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: double 

(D) TOPOUDGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

20 GAATTCGGCA CGAGAGTGGC TOGAGTCTGG CTGCAGAGGG AAGACATCAG CAGGGAGGGA 60 

GCCAGGGCCT GTCACATCTT TCCTCTGGCC ATTGTCCTGG TCTTTGTAAG CCCAGAATCT 120 

CCCCTTCCCT GAAGGGAGGC CAGCACCCCA GGAGQGCAGC AGGTGTGCTG TGAGGGTTGG 180 

25 

AGTAGTGTGA GAGGTCAGGG TACACTAGAA TGGCCATGGA CACCATGTOG GGGTGCTCTG 240 

GGCTGGGCXy^ CAGAACAGTG TCCTTCCTGC TGCTCCTCCC CTGCAGCTTC CCCCGACCTT 300 

30 GTNGTTTATT TGGTTTGATA CCAATCAGCA GACCCTGCAA GGTGGAAGCT CCCAGGCTCT 360 

CAGTCCCACS ACTCTCATGT GCCAGTCACC CNTACTGTAA CTGCCCAATG AGTACTTCTT 420 

GCCCACTGCC AAGATAGAGC CAGTTTACCA AGACAGGGGA ATTGCAGTAG AGAAAGAGTT 480 

35 

GAATATACAT AGAGCCAGCT AAATGGGAGA GTQGAGTTTT CTTATTACTT AAATCAGCCT 540 

CCCVTAAAAT TCAGAGGTGA GAATTTTTCA AGGACAGTTT GGTGGSCAGG CCTAGGGAAT 600 

40 GGATQCTGCT GATTGGCTAG GGATGCAATC ATAGGGGTGT AGAAAAGTWC CTTGTGCACT 660 

GAGTCCACTT TTGGTGAGAG CTACCAAGGA GCTGCTQGTC TGCTGGTCCC QGTAGAGCCA 720 

TCTGGTGTCA GGAATGCAAA AGTG 744 

45 



50 



(2) INFORMATION FOR SEQ ID NO: 94: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 

(B) TYPE: . nucleic acid 

(C) STRANDECNESS : ' double 
55 (b) TOPOUDGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 



60 



QCAGGGGAAT TCGGCCACGG AQGGGTTrCA ACAGGGCCCG TGGGGTGAGG TGCARACACA 



60 
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AAGCCCATAA GTGCTGGCCT GTTGGGACAA ATGAGAGAAA TCCCATAGGG TGGTGATGAC 120 

AGCGCAYTCA GCCATCYTAY TCCTGGGGAA AATGAAACTT GTGCTCCTAT CAAATGCTCA 180 

5 GTTGTAAAAC TGGAAAAAAA TTTTAGAAGA CATCTTGTCC AGCATCTGTG TTTATGTCTA 240 

TAAAATGTAG AAAACTAAAG CACAGAGATG TTAAATGTTT TGTCCAAQGT CCAACAGCTG 300 

GTTAGCARGC TTGGTCTGGT GACCTTTCTA CTGAACCACA GTGCCGCTGG GGGAAGTCCT 360 

10 

OVGCACAGAT GGCTGCTGCT ATAGCTGGGG TATGGGCAGT ATTAGTAGTT AACCAGTCAA 420 

CCXy^AGTTCC CATAGTCTAG GTTCTGCTTC AGCTGGAGGT TAGGGAAAAA CACAAGAAftA 480 

15 TCCCTTACCA CTCTACCAGT GCTGGGGGAT GTACTAAGAG ATCCCC . 526 



20 (2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 426 base pairs 
(B) TYPE: nucleic acid 
25 (C) STRANDEI^^S: double . 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOM: SEQ ID NO: 95: 

30 GGCACAGGGC AGGAGAGACT TGGTCCATGG GGAGAAGCCT GCAGTATAGA TGGGAGCTCC 60 

AGGAGCCCAA GTAGCATAGA CCCTGCTGAT CCGGGGCCAT TGAGCCAGAG GATTTGGGCT 120 

GAATGTCCCC AGAGACAAAA GGGAAAGGTA GATCCTTTCC CTTAAAGATG AAAGCCATCG 180 

35 

CCCGGGCTTG CTTATTGCTC TCTCTCCTGG TCCTTCCACA TGTTGTTTCT GAACATTTGT 240 

TCTGGCATCA CAATCCCCGT CATCCTGTCA TCTGGCCCTT CCCACCTTTC CACCTTATCT 300 

40 CITQCAGTGT CTCCGCGTCG ACCTGGCACC TGGGTGAARG CTTGCTCTTG CTGGTGCCCA 360 

TAGCCCCCAG TGTATGGTCT TGAMCTCCCC AGCCATATQG ARACCCACCT' CAGGAQGGCC 420 

CCTCGA 426 

45 



(2) INFORMATION FOR SEQ ID NO: 96: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

GGCACAGCGG CACGAGATAG GAAGCTTGGC AGGGGCAGCT CCCCCAGTGC GCATTGCCCT 60 
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GTAACTCGAG CGCCTGGGAG TGGGGAGAGG CTTGGAAATG GAGCAGGGTG GTGGACCTCG 120 

TCTTCTCCTG GTCATCCCAG GCCTCCTCCA TAACACCTAC CTAGCACGGC CTGGGGACTT 180 

CCCAGCCCAA GGAACAACTG AGAATACTGA GTGCCAGGGT AGCCCTAGCC CCATTTCACA 240 

CCTGGGCAAA GTC3AGGTCAC TGGATTCAAA CACTCAGATT TAAACCTCCT CTGTGTCTGC 300 

AGCACCTGTA TATAACTGCC AQCCTCTGCT GCCCCTCTCC AAAAAGTCTC TGCCCTTGTC 360 

TTTGGCACCT GTCTCTGTCC TCCCCATTCT CTGCTCCTCC TTTCTCCAAC TCAGANTCAC 420 

CCTGTTAGTT CAGCAAATGT TCATCGAGCT CCATAATGTA GCAGGACAGG NCTGTCTAAC 480 

15 AGATTCTGGN CTTGCAAGGG TGAGACAAGT ACTCTCCATC TTTCTCTCAT CTTCACAGAT 540 

GGTCTGCTCA ACAACTTTGC ACTGAAITGT AAATAATTGA TACTGCATAA AACATTGATG 600 

TTCTTTAAGG GTAGTCCAGC AAGGTGGCAA GTCTTATAAT GATAACTGCT CAAGGATCTC 660 

20 

TCAGTGAAGC ATTTGGGGST GCTAGCTCTG CTTATGGGTG AGGTCAGCTA TCTCACGCCA 720 

TCTACTTCCA CNTGCCCCCC CATGCCAGGC TCACCCTGAG CTGAGATGCC TGAGCAGGTG 780 

25 GCAGAAAGGA GCCACCTGGT TTATGCTTCG GGACXIACAAA CTCCTCTATC CAGANGACAG 840 

TTTT 844 



30 



(2) INFORMATION FOR SEQ ID NO; 97: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1985 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

AQCCCTQCTG AAGTACAQGT TCTTCTATCA GTTTCTGTTG GGCAATGAAC GAGCAACAGC 60 

AAAGGAGATC AGGGATGAAT ATGTQGAGAC GCTGAGCAAG ATTTACCTGT CTTACTACCG 120 

45 

CTCTTACCTG GGGCGGCTCA TGAAGGTGCA GTATGAGGAA GTCGCTGAGA AAGATGATCT 180 

AATGGGTGTG GAAGATACAG CAAAGAAAGG ATTCTYCTCA AAGCCATCGC TCCGCAGCAG 240 

50 GAACACCATT TTCACCCTAG GAACCCGCGG CTCTGTCATC TCCCCCACTG AACTTGAGGC 300 

CCCCATCCTG GTGCCTCACA CAGGGCAGCG GNAGAGCAGA OGTATCCATT TGAGGCCCTC 360 

TTCCGCAGCC AGCACTACGS CCTCCTAGAC AATTCCTGCC GCGAATACCT TTTCATCTGT 420 

GAATTTTTTG TTGTGTCTGG CCCAGYTGCA CACGACCTGT TCCATGCTGT CATQGGCCGT 480 

ACACTCAGCA TGACCCTGAA ACACCTGGAT TCTTATCTAG CTGACTGCTA CGATGCCATT 540 

60 GCTGTTTTTC TCTGTATCCA CATTGTTCTC CGGTTCCGTA ACATTGCAGC AAAGAGGGAT 600 
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GTTCCTGCCC TGGACAGGTA CTQGGGAACA GGTGCTTGCC TTGCTATGGC CACC3GTTTGA 660 

ACTGATCCTG GAGATGAATG TTCAGAGCGT CCGAAOCACT GACCCCCAGC GCCTAGGGGG 720 

5 

GTTGGATACT CGGCCCCACT ATATCACACG CCGCTATGCA GAGTTCTCCT CCXOTCTTGT 780 

CAGTATCAAC CAGACAATTC CTAATGAACG GACCATGCAA TTGCTGGGAC AGCTGCAGGT 840 

10 GGAGGTGGAG AATTTTGTCC TCCGAGTGGC AGCTGAGTTC TCCTCAAGGA AGGAGCAGCT 900 

TGTGTTTCTG ATCAACAACT ATGACATGAT GCTQGGTGTG CTGATGGAGC GGGCTGCAGA 960 

TGACAGCAAA GAGGTTGAGA GCTTCCAGCA GCTGCTCAAT GCTCGGACAC AGGAATTCAT 1020 

15 

TGAAGAGTTG CTGTCTCCCC CTTTTGQGQG TTTAGTGGCA TTTGTGAAGG AGGCTGAGGC 1080 

TTTGATTGAG CX3TGGACAGG CTGftGCGACT TCGAGGGGAA GAAGCCCGGG TAACTCAGCT 1140 

20 GATCCGTGGC TTTGGTAGTT CCTQGAAATC ATCAGTGGAA TCTCTGAGTC AGGATGTAAT 1200 

GCX3GAGTTTC ACCAACTTCA GAAATGGCAC CAGTATCATT CAGGGAGCGC TGACCCAGCT 1260 

GATXXAGCTC TATCATCQCT TCCACCGGGT GCTGTCCCAG CCGCAGCTCC GAGCCCTCCC 1320 

25 

TGCCXX3GGCT GAGCTCATCA ACATTCACCA CCTTATGGTG GAGCTCAAGA AGCATAAQCr 1380 

CAACTTCTGA TGTGCCAGAA ACCGCCCTGA GATCTGCCGG TCATCTCCAT GGACTTCTGC 1440 

30 ACCCXaVTTCC ATACCCTTCT TCACCTQGGG TACCCCTTCC AGTTTTCCCC TTGCTTCCCA 1500 

GGCCCTTGAC ATGGCTTACC TGCCTTCACT CCCAGCACCT TGCCCAACAG GATAAGCTGG 1560 

ATCCCCTTGG CCTTCTGAAT ATCCCAGTGT CTTCAGGTTT CCCAAGACCA CTTCCCTGTG 1620 

35 

GGCTTCCAAA ATGGCCTTTA TCATTTCTCC AGTCTGTCAC CXTTCCTTTCC TGCTCCCATA 1680 

CACCCAAGGC TTGTTTCTTC CCCTGTAAAA ACCACTGCCT CAATCTCTGG TTCACTCAAC 1740 

40 TAGTCACCAT GTCCTGAGGC ATGAAGCCTC CTCASCTCTT GGAATTGCTG GCAAC3GGGTG 1800 

ACTGCCTCTG AGTCATTGTG TTTTTCAAAG TOATTTCTTT TCTGTAGCTT TTTGACCTAA 1860 

GAITCTCAGCA ATTTGAACAC TAACrTCTCC CXTTCCTOGCT CAAGAATTAC TCCGAAGTCA 1920 

GTCTGCAGAA AATAAATATT TASTATGACA TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1980 

AAAAA 1985 



45 



50 



(2) INFOPMATION FOR SEQ ID NO: 98: 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

ATATGAAGGG AAAGAATTTG AITATGTTTT CTCAATTGAT GTCAATGAAG GTGGACCATC 60 

5 ATATAAATTG CCATATAATA CCAGTGATGA CCCTTGGTTA ACTGCATACA ACTTCTTACA 120 

GAAGAATGAT TTGAATCCTA TGTTTCTGGA TCAAGTAGCT AAATTTATTA TTGATAACAC 180 
AAAAGGTCAA ATGTTGGGAC TTGGGAATCC CAGCTTTTCA GATCCATTTA CAGGTGGTGG ' 240 

10 

TCGGTATGTT CCGGGCTCTT CQGGATCTTC TAACACACTA CCCACAGCAG ATCCTTTTAC 300 

AGGTGCTGGT CGTTATGTAC CAGGTTCTGC AAGTATGGGA ACTACCATGG CCGGAGTTGA 360 

15 TCCATTTACA GGGAATAGTG CCTACCGATC AGCTGCATCT AAAACAATGA ATATTTATTT 420 

CCCTAAAAAA GAGGCTGTCA CATTTGACCA AGCAAACCCT ACACAAATAT TAGGTAAACT 480 

GAAGGAACTT AATGGAACTG CACCTGAAGA GAAGAAGTTA ACTGAGGATG ACTTGATACT 540 

20 

TCTTGAGAAG ATACTGTCTC TAATATGTAA TAGTTCTTCA GAAAAACCCA CAGTCCAGCA 600 

ACTTCAGATT TTGTQGAAAG CTATTAACTG TCCTGAAGAT ATTGTCTTTC CTGCACTTGA 660 

25 CATTCTTCGG TTGTCAATTA AACACCCCAG TGTGAATGAG AACTTCTGCA ATGAAAAGGA 720 

AGGGGCTCAG TTCAGCAGTC ATCTTATCAA TCTTCTGAAC CCTAAAGGAA AGCCAGCAAA 780 

CCAGCTGCTT GCTCTCAGGA CTTTTTGCAA TTGTTTTGTT QGCCAGGCAG GACAAAAACT 840 

30 

CATGATGTCC CAGAGGGAAT CACTGATGTC CCATGCAATA GAACTGAAAT CAGGGAGCAA 900 

TAAGAACATT CACATTGCTC TGGCTACATT QGCCCTGAAC TATTCTGTTT GTTTTCATAA 960 

35 AGACCATAAC ATTGAAGGGA AAGCCCAATG TTTGTCACTA ATTAGCACAA TCTTGGAAGT 1020 

AGTACAAGAC CTAGAAGCCA CTTTTAGACT TCTTGTGGCT CTTGGAACAC TTATCAGTGA 1080 

TGATTCAAAT GCTGTACAAT TAGCCAAGTC TTTAGGTGTT GATTCTCAAA TAAAAAAGTA 1140 

40 

TTCCTCAGTA TCAGAACCAG CTAAAGTAAG TGAATGCTGT AGATTTATCC TAAATTTGCT 1200 

GTAGCAGTGG GGAAGAGGGA CGGATATTTT TAATTGATTA GTGTTTTTTT CCTCACATTT 1260 

45 GACAT6ACTG ATAACAGATA ATTAAAAAAA GAGAATACGG TGGATTAAGT AAAATTTTAC 1320 

ATCTTGTAAA GTGGTGGGGA GGGGAAACAG AAATAAAATT TTTGCACTGC TGAAAAAAAA 1380 

AAAAAAAAAA AAAAGGAAAC TCGAGGGGGG GCCCGG 1416 

50 



(2) ZNFORllATION FOR SEQ ID NO: 99: 

55. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
60 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

NTCTACCCTA ATCAAGATGG GGACATACTT CGCGACCAGG TTCTTCATGA ACATATCCAG 60 

5 

AGATTGTCTA AAGTAGTGAC TGCAAATCAC AGAGCTCTTC AGATACCAGA GGTTTATCTT 120 

CGAGAAGCAC CATGGCCATC TGCACAATCA GAAATCAC3GA CAATAAGTGC TTATAAAACC 180 

10 CCCCGGGACA AAGTGCAGTG CATCCTGAGA ATGTGCTCTA CGATTATGAA CCTCCTGAGC 240 

CTGGCCAATG AGGACTCTGT CCCTGGAGCG GATGACTTTG TTCCTGTCTr GGTGTTTGTG 300 

TTGATAAAGG CAAATCCACC CTGTTTGCTG TCTACTGTGC AGTATATCAG TAGCTTTTAT 350 

15 

GCTAGCTGTC TGTCTGGAGA GGAGTCCTAT TGGTGGATGC AGTTCACAGC AGCAGTAGAA . 420 

TTCATTAAAA CCATCGATGA CCGAAAGTGA CCAAGACCAA GGCCCACCAA QGCAGCAGAC 480 

20 TGTTAATCAG ACAAACAGAT CTCTGAGAAG GTGCATCAGC TGCTTTGAAG GCTGAAGATT 540 

GTTTTGTATG ATACTGCACA GCATCAGGCA TTTTAAAGCA GATCTTTACT AAACAGGTTA 600 

ATGAGCTAAC AAGCAQGTTC TCTCGTCTTT GGGCTCTTTC CTTTCTGAGT TGCATATTCT 660 

25 

ATTTTCTTGT CCCCAAGTAG i«3ACTAGTAC TACAAAAftGG GACCACATTT TTCftAGTRTT 720 

TCTAAGTATA AAAAACAAAA. CAAAAATCTC TTAOGAAA.TG TCTASACCTC CATTCTTGGA 780 

30 TTCCCTTTCT TTCCTTTTAT TTTAAAAAAG AACAGTACCC CTCTTTTAAG ATQCTGTCTT 840 

ACATTAATGA GCATCTAATG GAAAGAAGGT ATGAGTTGCA CTGAGGATTA GAATAGTGGT 900 

GCGTTAGTGG CATTATCTAT AAATACACTC ACCTAAATTG AAAGCTAAGA AGGAAATGTA 960 

35 

AATATAATAT ATATTTATAT TTGATGTAAT ATGGACATCT GCAGATTCTA ATAAACAAGG 1020 

ACTATTGCTG ATAGTAGGCT GTGACATACT GTCTTGTGAA ATGGTTTCCT TGACAAAATT 1080 

40 TAAGCTGAGC TTAAAAGCAA AAAAACAAAA AGTACACAGA AATATTTATT AAAATGTAAT 1140 

ACAGTTTATT GAACTTTCTA GGTATGGAGT TTGATGGACA GGGCTGCCTY TAATGAGTGT 1200 

GAAGGTCACT AAGTCACTTA GACATCTCAC CGTGGAAGTT TGTGAGCCTG CATTAGGAGA 1260 

45 

TAGACTGATT ACCATACATG ACATAAAAAG GAACAGTGGA TAGCTCATAC TTTATGGTQG 1320 

TTCTTCTCCT CCGAAATAAT ATACTGCAGA AATCCCAGAC AGAGCTCCTT ACAAACCTTT 1380 

50 AATTGTAATA TATTTTTGAT GATTATTCAC ATTGAATGCA CAGACCAAGA ATTCAGTGAA 1440 

TGTCATTTTT TAAAAAACTA ATTTGTATTG TCTGCTCTAG TGATACAAGT TTTACTAGTG 1500 

ATAAACTATT TTAATCAACC ATACTATTCT TATQGAAAAA AATATCTATT TTGGCAQGTT 1560 

55 

TCTGriGOnT TATTTCCCTC TTCTGAAAAA AAGTCrQTGT TTTCATAGTT TGGTTTGCAT 1620 

TGTATATCAA TAATTAATCA GGAATGGGTT TTGGTGCCTG AAAAATTGGC CATGGAQGCA 1680 

60 CACCAAAGCT TCAAGCACAA GTCTTGTACA TGGGCCATCA CTGTCTGGTT TCACTTCGTG 1740 
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TGTTTCCTAA ACACATTTAG CTGCTTTTTT AACAAACTCA GCCCCATACT TGAGTCCCTT 1800 

GTTGTTGGGA GCATTTCCAG GCATCTTTTA AGGGAACTGT GACAAACAGC CTCGGGCAGA I860 

TGAACACGGA GGCTCTCTGT TGTCTGTCTC TGAGATCTTT GTGTCTGGGA ATGCCTAAAG 1920 . 

NTTTTGNTTT TTTTT 1935 

(2) INFORMATION FOR SEQ ID NO: 100: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEE^JESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
GAATTCGGCA CGAGCGTCCA CGCflGCCGCC GGCCGGCCAG CACCCAGQGC CCTGCATGCC 60 
25 AGGTCGTTGG AGGTGGCAGC GftGACATGCA CCCGGCCCGG AAGCTCCTCA GCCTCCTGTT 120 
CCTCATCCTG ATGGGCACTG AACTCACTCA AGACTCCGCT GCCCCCGACT CCCTGCTGAG 180 
AAGTTCAAAG GGCAGCACGA GGGQGTCTTT GGCTGCTATT GTCATCTGGA GGGGGAAGAG 240 

30 

TGAGAGCCGG ATAGCCAAGA CCCCAGGCAT TTTCAGAGGT GGCGGGACCT TAGTCCTACC 300 
CCCAACACAC ACCCCTGAGT QGCTCATCCT CCCTTTGGGC ATAACGCTGC CCTTGGGGGC 360 
35 TCCAGAAACA GGCGGTGGGG ATTGTGCCGC TGAGACCTGG AAGGGCAGCC AGCGTGCCGG 420 
CCAGCTGTGT GCATTGCTGG CTTMTATGC AGGGCTTGGG GGGCTGTGGC CACATGCCCG 480 
GCAGGAGGTG AGTGAGGAGC CCTGTGGCGT QCTGGTGTQG GGATCGTGGG CATTTCAAAC 540 
GGGCTTGTCG TACCCTGAAC AATGTATCAA TAGAGAAAAA AAAAAAAAAA AAAACTCGA 599 



40 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 101: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTIOI: SEQ ID NO: 101: 

GAATTCGGCA CAGAAAAAAA AGAGAGACTG GGTCTTACTG TGTTGCCCAG ACTTGTCTTG 60 

AACTCCTGCC TCAGCCTCTC AAGTACTTGG GATTATAGGC CAAGAAGCCA CCATGCCTAG 120 

60 CTTCTTCCTG TCATTGATCC AGACTAATAC TCTGGGGTCA GCCTCATTTC TTCTCTTTCT 180 
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CACTTTGCAC ATCCACTTCT CACCAAATCK I«3TTCATTCT GCATCCTAAG TAAGTCCTTT 
GATTCCTCCA GTTGTTCATT AGTAATGTCT CAARTGTAAT TTTTTCTAGT AGTTTTCAGC 
CTGTGTTTCC KGCCTTCAGT CTTAACTTCT CXy\GTACATA KGCCACATTC TTGTCAGCAK 
GATCAWATTT TATTTAAAAA TACTTTACAW AKGTTTATKG CCAAATATTA GRAAATACAG 
ATTCATGGAA AGAAAAATCA CTGTCCCAAG GAGGTCACTG GCATGGTGAG GTTAAGGGGT 
GATTTTAATT TTTAAAAATG TATATTTTTT CXrTGTGTAGA GTAGTAACAC CCTTCAAAAC 
ACAWrcCCTT GTAAAGTCTC TAATIXTOTA CTCCX3CATCT AGSTGRTCTC TTCTTTCTCA 
GATATTTTAC AATTTCATTT ATCACXyVCCT TTCTCTAGCC TTTACCCGTC TCTTCAATAT 
TWACATATGC AGAAGTTTCT CCTAACAAAC ACCTGCCTCT GCCTCAGTTC TCCTACCACC 
CTGTTGCTTT CTTTCCTTTC ACAATCAAAT TTAAGAGTGT CAAAAAAAAA AAAAAAAAAC 
TCGA 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 base pairs 
(B> TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICaJ: SEQ ID NO: 102: 
AGAGGCCTGG CTGCGTTGCC CTATCTCCGT CTCCGCCACC CACTTAGCGT TTTAGGCATC 
AATTACCAGC AGTTTCTCCG CCACTATCTC GAAAATTACC CGATTCCTCC CGGCAGAATA 
CAAGAGCTTG AAGAACGCCG CAGTTQCGTG GAAGCCTGCA GAGCAAGGGA AGCAGCGTTT 
GATGCCGAAT ATCAGCGAAA TCCTCACAGG GTGGACCTCG ATATTTTAAC CTTTACGATA 
GCTCTGACTG CCTCTGAAGT TATCAAOCCT CTGATAGAAG AACTTGGT1G CGATAAGTTT 
ATCAATAGAG AATAGTTAGG TGGTGACACT ACTTCAAGAG AACCTCTGCA TTCCAGTCAT 
ACCAATCCTG CAACTTGATT TTCAGAAGTC AAGAGTATAT CGCGATAAGA CAGTCCACAG 
GTGGAQGGGA AAAAAAQGGG GAGGQGGAAG CTTATCTTGA AAAAGCATCA CAGAAGTAGA 
AAAAAATGTC GAAAGCATTA TAACTGTAAC GTTCTTTGAG TTTGTGATTG ATCCACATTT 
TTCCCCCTGC ATTATGGAAA ATGTCTCTCA GCATTGCTTT ATTACAAAGT AAAGGATGGT 
TTTATAAAAT TCAGACTGAT GAAACATCAA TACTAGAGCC CATGAGGATG AAAGAAATTA 
TCAAATAGTG CTGAACAGAA TAAGATGTTA AOGCTGAGTT ATTAGGACTG GAAGGCTATG 
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353 

AAAAGAACTT GAAATTGTCG GAATATGTGC TCTCTTCATG TCATATTCAA TAGAAGTTTC 780 

TAGTTTAAGA TTGATTTTGT GTTTTCTTAG GCATTTCAAG TGACAAGCAA AGTAAATGTA 840 

TATATTATGT GATAAATCAT GTTTTCAAGA ACGTCAAATT TCTGGACTTT TTTCTTTCAA 900 

TTTTTAATTT TTAAAGTTTT TTTGGTATTA AAA7ATCYAT TCACAAGCCA AAAAATWIMT 960 
WAAATWIVOl GCX3AAAAGCC AAAAAAAAAA AAAAMMAGGG GGGGCCGGGC CCCATCCCCC " 1020 

CAAQGQGGTC CNGNT 1035 

(2) INFORMATION FOR SEQ ID NO: 103: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2218 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

AGGTATTAGG CCCTTTTGTG GGAGCCCCAT GTTTTGTTTT TCTGAGTTGG TGGGGAGGGA 60 

SGGAGGGGGA GQGCTGAATT GTTTTGCAGA QGAAGATGGC ATCTGTGCTT TAAATTTCTC 120 

30 ATTACTQGGT TAGAAAACAA AGAGGGAKTG CCCTGCACAT TTTCTTTTGT GCTTTTAAAT ISO 

GTTTCTTAAG TTGGAACAGG TTTCCTCGGG CCTGTTTTGA CTGATTGCTG GAGTGCATTT 240 

GATAGTTAAA AATTACTAAT TGGTTTTATT TCCCTTCACA CTCTGCCTCC CCACTTCTCC 300 

35 

CCCCGTTACT GAAAAATAAC CA1TTTAGTG TCAGGCTAGA AATTGAATTG CTGAGTTTTG 360 

TGTATCCTTT AAATTAAAAA CCACAAGTGT TTATTGTAGT GGTTAAACTG TAGCATCTCA 420 

40 GCATCTGGGT GGAAGCTGCC TATATTTCTT CCCAGTTTAA CTGC5GGACCA TCTGTGAAAT 480 

TAATTTTCCA TCCAGACAGC TGCTGTGAGC AAATGAACAT AAATGCTCGC TQGAAATTTA 540 

CTAACCAGTT TTTATATTGA CCTGCAGTGT AAAAAGCACA TTTAATTATA AACAATATAT 600 

45 

TCAAAATGGG CAAATTTTAT TTTCAAATGC AGTGTAGAGC TAGATTAAAA GCAACTCTTT 660 

GCCACCTACT CTGCCCTTTT GGCAAAGTTA CCTTGAACAA AGAATCTTAA GGGTTTATTA 720 

50 AGAACTCTTT ATTTTCTTCA TACCCTGTTC TCTGCAGTGC TTTCTAACAG CTTCTGGGTG 780 

CAGATTTTCT TCGGCATCCT TTTGCACTCA GCTTATTACA GGTAGGTAGT GCTTAAGAAA 840 

AGTCATGGAG GACTAAAGCC TAAGTCCTTT TCACTTTTCC TCCATCTGAA GGTAGGTGAG 900 

TTCATCCTCT TCATAGTAAT GCTGTTTTAC CAAGACTTTA TAGCAGATGG ACCCAGAAAG 960 

AATTTTCTGC TATTGTGTTC ACTACAACAG GATAGGGACA TCAGACAGCC CCAGAAACCC 1020 



60 CTTCCauGATC TGATATQGGA CTATTAATTT TTATtSCTGTT AATTGGTATT CATTCACAAT 



1080 
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GCAGTTGAAG GGC3GAAGGCT CCACTCCATT CTTTGGCTAA GGCCTGAATC CTTCCTCATC 1140 

^ TGTAAGATCT MACTCGAGG TTrTGTTTTC CTTTTAAAAT TCTTTAGGGA GAGAGGGATC 1200 

GTTTCTGAGG GGTTCTGAAA C3TATGATTCA ATGTGCAACA TACAGGTAGG TCTTCAGCAT 1260 

AAGCTGAAAT ATATGCATGT AAAAACTTTG ACATCTTTTT rrTTAATTTT CCACTTTCTT 1320 

10 CTTAACTTTA CTTCTCTTTT TGTCCCCCCC CCATCTTACA GAACTTGAGG CCAAGGGAGA 1380 

ATGGTAGGCA CAGAAGAAAC ATGGCAAACT GCTCTGTGCT TTCAAACCAA AGTCTTCCCC 1440 

CCAACCCCAA ATTTGTCTAA GCACTGGCCA GTCTGTTGTG GGCATTGTTT TCTACAACCA 1500 

AATTCTGGGT TTTTTTCTTC TTTCTTTAAA CATAGftGGTA CCACCACAAG GGATCCCCTA 1560 

CTCTCTCGCA GCTCTTGAAA GCATCTGTTT GAQGGAAAGG TCTCTGGGCA AGCAAGTGGT 1620 

20 TATTTGGATT GCTTGCTTCC CTTTTTCCAC CTGGGACATT GYAATCATAA AATAACAGTA 1680 

AATTCCAAAC CTCAAAAACT ATTATGGCCT GAGCACAGCT GAAATCTAGC AGAGTTTAAC 1740 

TCTTCTGCCT CCATGTCTGT . CACTTATAAT TCAGGTTCTC CTGTTQGCTT CAGAACATGA 1800 

GCAGAAGAAT CGTTTTATGC TAGTTATTGC ATTCATGGTT GAAACTCAAC TTAGGGAAAG 1860 

GGTTCCAATG TATTAAGCAA TGGGCTGCTT CTCCCCAATC CTCCCTAACA ATTCGTTGTC 1920 

30 TGGACTTCTC ATCTAAAAGG TTAGTGGCTT TTGCTTQGGA TCAGTCCTCT CTATTGA3X3T 1980 

TCTTGCTGGT CTCCAGACAC ATTCCTGTTG CATTAAGACT TGAAAGACTT GTAGATOTGT 2040 

GATGTTCAGG CACAGGATGC TGAAAGCTAT GTTACTATTC TTAGTTTGTA AATTGTCCTT 2100 

TTGATACCAT CATCTTGTTT TCTTTTTGTA GGTATAAATA AAAACACTGT TGACAATAAA 2160 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 2218 

40 

. (2) INFORMATION FOR SEQ ID NO: 104: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANI^DNESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

CTTCACAGAC TGACAGAATG Grm^ ' XTn ' GTTrXX>TiTr GTTT i ' GTm ' GTTTTTGAGA 60 

55 TGGACTCTAG CTCTGTCACC CAGGCTGGAG TGCAGTGGTG CGATCTCGGC TCACTCCAAG 120 

CTCCGCCTCC CGGGTTCTCA CCATTCTCCT GCCTCAGCCT CCCGAGTAGC TGGGACTACA 180 

GGCGCCCACC ACCACGCCCG GCTAATTTTT TGTATTTTTT AGTAGAGACG GGGTTTCACC 240 

60 
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ATGTTAGCCA GGATC3GTCTC GATCTCCTGA CCTQGTGATC CGCCCGCYTC GGCXrTCCCAA 300 

AGTGCTGGGA TTACAGGCGT GAGCCACCGT GCCTGCCCCA GAA-KSGrTTT TAAAGCCACA 360 

5 GTTGAGARGC CACCCATTGC CCGGCGCCTG GACAGTGATC ATCTTGTTCA TCTTCTTCAG 420 

TCCTTTCTTG TGTCATTGGA ATTATTCATC CCCTTTGAAA GATGAGAAGG TTGAGATCCA 480 

AAGAGTCTAC CTTTCCAAGT TCTCACTGCT GGAAAGARCT AGAAGCACAG TTCAAAGTTC' 540 

TGCMTCrGG ACrCTGCAGT CXIAGGTOrCC CTTYTCCCAC TTGOCTACCC TCAATCCCAC 600 

ACTGTTTTTG AAGTGGCCCA TAACTTGAAG GRAAAGTTTA AAGACAGTTC AATTTAATCA 660 

15 TCAGRATGCA TTCTTTTTTT TTTCGGARAC GGAKTTTCAC TCTTCCTCCC CASGCTGGAG 720 

TGCAATGGTG CAATGATCTC GGCTCACTGC AACCTATGCC TCCTGGGTTC AAGNGATTAT 780 

CCAGCCTCAG CCTCCCGAGT AGCIGGGATT ATGGGCGCCC ACCACCATCC CCACSCTAATT 840 

.TTTGTATTTT TTTTTTTAGT AGAGATOGGG TTTCQCCAGG TTGGCCAGGC TCRTCTTOTC 900 

AAVTCCTGGC YTCAGGTCAT YTGCCCACYT CATCYTCCAA AAGTCCTCGG ATTACAGGCA 960 

25 TGAGCCACTG CGCCTGGCYT CAGAATGCAT TCTTACACAT CTATCCTAGA CATTTATAAG 1020 

CACTCTAATG GATAACAATC CAAGAATAAA TGATTGTAAA AGATCATCCC GAAGAGTTGA 1080 

TGTCAATCTT TTTTTCCTAA GAAAAAAAGT CCGCGAGTAT TAAATATTTA GATCAATGTT 1140 

TATAAAATGA TTACTTTGTA TATCTCATTA TTCCTATTTT GGAATAAAAA CTCACCTTCT 1200 

TTAATCATAT ACTTGTCTTT TOTAAATAGC AGCTITTGTG TCATTCTCCC CACTTTATTA 1260 

35 GTTAATTTAA ATTGGAAAAA ACCCTCAAAC TAATATTCTT GTCTCTTCCA GTCTTATAAA 1320 

TAAAACTTAT AATGCATGTA AAAAAAAAAA A 1353^ 



40 



45 



50 



(2) INFORMATICN FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2066 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxjble 

(D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GGCACGAGGC GGCGGAGGGC CACAATCACA GCTCCGGGCA TTGQGGGAAC CCGAGCCGGC 

TGCGCCGGGG GAATCCGTGC GGGCGCCTTC OGTCCCGGTC CCATCCTCGC CGCGCTCCAG 120 

CACCTCTGAA GTTTTGCAGC GCCCAGAAAG GAGGCGAGGA AGGAGQGAGT GTGTCAGAGG 180 

AGGGAGCAAA AAGCTCAOCC TAAAACATTT ATTTCAAGGA GAAAAGAAAA AQGGGGGGCG 240 

60 CAAAAATGGC TGGGGCAATT ATAGAAAACA TGAGCACCAA GAAGCTOIX3C ATTOITO^ 300 
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GGATTCTGCT CGTCTTCCAA ATCATCGCCT TTCTGGTGQG AGGCTTGATT GCTCCAGGGC 360 

CXaCAACGGC AGTOTCCTAC ATGTCGGTGA AATGTGTGGA TGCCCGTAAS AACCATCACA 420 

5 

AGACAAAATG GTTCX?rGCCT TGGGGACCCA ATCATTGTGA CAAGATCCGA GACATTGAAG 480 

AOSCAATTCC AAGGGAAATT GAAGCCAATG ACATCGTGTT TTCTGTTCAC ATTCCCCTCC 540 

10 CCCACATGGA GATGAGTCCT TGGTTCCAAT TCATGCTGTT TATCCTGCAG CTGGACATTG 600 

CCTTCAAGCT AAACAACCAA ATCAGAGAAA ATGCAGAAGT CTCCATGGAC GTTTCCCTGG 660 

CTTACCGTGA 1GACGCATTT GCTGAGTGGA CTGAAATGGC CCATGAAAGA GTACCACGGA 720 

15 

AACTCAAATG CACCTTCACA TCTCCCAAGA CTCCAGAGCA TGAGGGCCGT TACTATGAAT 780 

GTGATGTCXT TCXTTTTCATG GAAATTGGGT CTGTGGCCCA TAAGTTTTAC CTTTTAAACA 840 

20 TCCGGCTGCC TGTGAATGAG AAGAAGAAAA TCAATGTQGG AATTGGGGAG ATAAAGGATA 900 

TCCGGTTGGT GGGGATCCAC CAAAATGGAG GCTTCACCAA GGTGTGGTTT GCCATGAAGA. 960 

CCTTCCTTAC GCCCAGCATC TTCATCATTA TGGTOTGGTA TTGGAGGAGG ATCACCATGA 1020 

25 

TGTCCCGACC CCCAGTGCTT CTGGAAAAAG TCATCTTTGC CTTTGGGATT TCCATGACCT 1080 

TTATCAATAT CCCAGTGGAA TGGTTTTCCA TCGGGTTTGA CTGGACCTQG ATGCTGCTGT 1140 

30 TTGGTGACAT CXXSACAGGGC ATCTTCTATG CGATQCTTCT GTCCTTCTGG ATCATCTTCT 1200 

GTQGCX3AGCA CATGATGGAT CAGCACGAGC GGAACCACAT TGCAGGGTAT TGGAAGCAAG 1260 

TCGGACCXIAT TGCCGTTQGC TCCTTCTGCC TCTTCATATT TGACATGTGT GAGAGAGGGG 1320 

35 

TACAACTCAC GAATCCCTTC TACAGTATCT GGACTACAGA CATTGGAACA GAGCTGGCCA 1380 

TGGCCTTCAT CATCGTGGCT GGAATCTGCC TCTGCCTCTA CTTCCTGTTT CTATGCTTCA 1440 

40 TGGTATTrCA GGTGTTTCGG AACATCAGTG GGAAGCAGTC CAGCCTGCCA GCTAT6AGCA 1500 

AAGTCCGGCG GCTACACTAT GAGGGGCTAA TTTTrAQGTT CAAGTTCCTC ATGCTTATCA 1560 

CCTTQGCCTG CGCTGCCATG ACTGTCATCT TGTTCATCGT TAGTCAGGTA ACGGAAGGCC 1620 

45 

ATTGGAAATG GGGGGGCGTC ACAGTCCAAG OXSAACAGTGC CTTTTTCACA GGCATCTATG 1680 

GGATGTGGAA TCTGTATGTC TTTGCTCTGA TGTTCTTGTA TGCACCATCC CATAAAAACT 1740 

50 ATGGAGAAGA CCAGTCCAAT GGAATGCAAC TCCCATGTAA ATCGAGGGAA GATTGTGCTT 1800 

TGTTTGTTTC GGAACTTTAT CAAGAATTGT TCAGCXKriTC GAAATATTCC TTCATCAATG 1860 

ACAACGCAGC TTCTGGTATT TGAGTCAACA AGGCAACACA TGTTTATCAG CTTTGCATTT 1920 

55 

GCAGTTGTCA CAGTCACATT GATTGTACTT GTATAOGCAC ACAAATACAC TCATTTAGOC 1980 

TrrAOCrCAA AATGTTAAAT ATAAGGAAAA AAGCGTCAAC AATAAATATT CTTGAGTATA 2040 

60 AAAAAAAAAA AAAAAAAAAA AAAAAA ^ 2066 
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5 (2) INFOPMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1705 base pairs 

(B) TYPE: nucleic acid 
10 (C> STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICW: SEQ ID NO: 106: 

15 AATTCGGCAK AQGGCAGCTG TCGGCTGGAA GGAACTGGTC TGCTCACACT TGCTGGCTTG 60 

CGCATCAGGA CTGGCTTTAT CTCCTCSACTC ACGGTGCAAA GGTGCACTCT GCGAACGTTA 120 

AGTCCGTCCC CAGCGCTTGG AATCCTACGG CCCCCACAGC CGGATCCCCT CAGCCTTCCA 180 

20 

GGTCCTCAAC TCCCGYGGAC GCTGAACAAT GGCCTCCATG GGGCTACAGG TAATGGGCAT 240 

CGCGCTQGCC GTCCTGGGCT GGCTQGCCGT CATGCTGTCC TGCGCGCTGC CCATGTQGCG 300 

25 CGTGACGGCC TTCATCGGCA GCAACATTGT CACCTCGCAG ACCATCTGGG AGGGCCTATG 360 

GATGAACTGC GTGGTGCAGA GCACCGGCCA GATGCAGTGC AAGGTGTACG ACTCGCTGCT 420 

GGCACTGCCG CAGGACCTGC AGGCGGCCCG CGCCCTCGTC ATCATCAGCA TCATCGTGGC 480 

30 

TQCTCTGGGC GTGCTGCTGT CCGTGGTGGG QGGCAAGTGT ACCAACTGCC TGGAGGATGA 540 

AAGCGCCAAG GCCAAGACCA TGATCGTGGC QGGCGTGGTG TTCCTGTTGG CCGGCCTTAT 600 

35 GGTGATAGTG CCGGTGTCCT GGACGGCCCA CAACATCATC CAAGACTTCT ACAATCCGCT 660 

GGTGGCCTCC GGGCAGAAGC GGG^TGGG TGCCTCGCTC TACGTCQGCT GGGCCGCCTC 720 

CGGNCTGCTG CTCCTTGGCG GGGQGCTGCT TTGCTGCAAC TGTCCACCCC GCACAGACAA 780 

40 

GCCTTACTCC GCCAAGTATT CTGCTGCCCG CTCTGCTGCT GCCAGCAACT ACGTGTAAGG 840 

TGCCACGGCT CCACTCTGTT CCTCTCTGCT TTGTTCTTCC CTGGACTGAG CTCAGCGCAG 900 

45 GCTGTGACCC CAGGAGGGCC CTGCCACGGG CCTkCTGGCTG CTGGGGACT6 GGGACTGGGC 960 

AGAGACTGAG CCAGGCAGGA AGGCAGCAGC CTTCAGGCTC TCTGGCCCAC TCGGACAACT 1020 
TCCCAAGGCC GCCTCCTGCT AGCAAGAACA GAGTCCACCC TCCTCTGGAT ATTGQGGAGG ■ 1080 

50 

GACGGAAGTG ACAQGGTGTO GTGGTGGAGT GGGGAGCTGG CTTCTGCTGG CCAGGATGGC 1140 

TTAACCCTGA CTTTGGGATC TGCCTGCATC GGTGTTGGCC ACTGTCCCCA TTTACATTTT 1200 

55 CCCCACTCTG TCTGCCTGCA TCTCCTCTGT TGCQGGTAGG CCTTGATATC ACCTCTGGGA 1260 

CTGTGCCTTG CTCAGOGAAA CCCGCGCCCA GGAGTATGGC TGAGGCCTTG CCCACCCACC 1320 

TGCCTGGGAA GTGCAGAGTG GATGGACGGG TTTAGAGGGG AGGGGCGAAG GTGCTGTAAA 1380 

60 
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CAGGTTTGGG CAGTGGTC3GG GGAGGGGGCC AGAGAGGCGG CTCAGGITGC CX»CXrrCTGT 1440 

GGCCTCAGGA CTCTCTGCCT CACCCGCTTC AGCCCAGGGC CCCTGGAGAC TGATCCCCTC 1500 

5 TGAGTCCTCT GCCCXOTCCA AGGACACTAA TGAGCCTGGG AGGGTGGCAG GGAGGAGGGG 1560. 

ACAGCTTCAC CCTTGGAAGT CCTGGGGTTT TTCCTCTTCC TTCTTTGTGG TTTCTGTTTT 1620 
GTAATTTAAG AAGAGCTATT CATCACTGTA ATTATTATTA TTTTCTACAA TAAATGGGAC ' 1680 

10 

CTGTGCACAG GRAAAAAAAA AAAAG 1705 



15 

(2) INPORMATIGN FOR SEQ ID NO: 107: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1167 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiable 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

25 

TGCAGGAATT CGGCAGAGGT TTTCOGCTAG ACTCTGGCAG TTGGTGAGCA TCATGGCAAC 60 

CGTTACAGCC ACAACCAAAG TCCCGGAGAT CCGTGATGTA ACAAGGATTG AGCGAATCGG 120 

30 TGCCCACTCC CACATCCGGG GACTQGQGCT GGACGATGCC TTGGAGCCTC GGCAGGCTTC 180 

GCAAGGCATG GTGGGTCAGC TGGCGGCACG QCGQGCGGCT GGCGTGGTGC TGGAGATGAT 240 

CCGQGAAGGG AAGATTGCCG GTCGGGCAGT CCTTATTGCT GGCCAGCCGG GCACGGGGAA 300 

35 

GACGGCCATC GCCATGGGCA TGGCGCAGGC CCTGGGCCCT GACACGCCAT TCACAGCCAT 360 

CGCCGGCAGT GAAATCTTCT CCCTCGAGAT GAGCAAGACC GAGGCGCTGA CGCAGGCCTT 420 

40 CCGGCGGTCC ATCGGCGTTC GCATCAAGGA GGAGACGGAG ATCATCGAAG GGGAGGTGGT 480 

GGAGATCCAG ATTGATCGAC CAGCAACAGG GACGGGCTCC AAGGTGGGCA AACTGACCCT 540 

CAAGAOCACA GAGATGGAGA CCATCTACGA CCTOGQCACC AAGATGATTG AKTCCCTGAC 600 

45 

CAAGGACAAG GTCCAGGCCG GGGACGTGAT CACCATCGAC AAGGCGACGG GCAAGATCTC 660 

CAAGCTGGGC CGCTCCTTCA CACGCGCCCG CGAACTACGA CGCTATQGGC TCCCAGACCA 720 

50 AGTTCGTGCA GTGCCCAGAT GGGGAGCTCC AGAAACGCAA GGAGGTGGTG CACACCGTGT 780 

CCCTGCACGA GATOGACGTC ATCAACTCTC GCACCCAGGG CTTCCTGGCG CTCTTCTCAG 840 

OTGACACAGG GGAGATCAAG TCA6AAGTCC GTGAGCAGAT CAATGCCAAG GTGGCTGAGT 900 

55 

GGCGCGAGGA GGGCAAGGCG GAGATCATCC CTGGAGTGCT GTTCATCGAC GAGGTCCACA 960 

TGCTGGACAT CGAGAGCTTC TCCTTCCTCA ACOQQGCCCT GGAGAGTGAC ATGGCGOCTG 1020 

60 TCCAGCAGGT CTATGGGGAT GCCGTGAGGG CTCTGGTAGC TGGTGCCCCG GATTCGCGTG 1080 



wo 98/54963 



PCTAJS98/n422 



359 



ATGCXACGGT TGGTGOXTC GTGCXX3AATT CCTGCAGCCC GGGGGATCCA CTAGTTCTAG 1140 
AGCGGCCGCC ACCGCGGTGG ANCTCCN 1167 

5 



(2) INFORMATION FOR SEQ ID NO: 108: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1907 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: double 
15 . (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

GGCACAGGGG AATCATCGTG TGATGTGTGT GCTGCCTTTG TGAGTGTGTG ' GAGTCCTCCT 60 

20 

CAGGTGTTAG GTACAGTGTG TTTGATCGTG C3TQGCTTGAG GGGAACCCTT GTTCAGAGCT 120 

GTGACTGCGG CTGCACTCAG AGAAGCTGCC CTTQGCTGCT CGTAGCGCCG GGCCTTCTCT 180 

25 CCTCGTCATC ATCCAGAGCA GCCAGTGTCC QGGAGGCAGA AGGTACCGGG GCAGCTACTG 240 

GAGGACTGTG CGGGCCTGCC TQGGCTGCCC CCTCCGCCGT GGGGCCCTGT TGCTGCTGTC 300 

CATCTATTTC TACTACTCCC TCCCAAATGC GGTCGGCCCG CCCTTCACTT GGATGCTTGC 360 

30 

CCTCCTGGGC CTCTCGCAGG CACTGAACAT CCTCCTGGGC CTCAAGQGCC TGGCCCCAGC 420 

TGAGATCTCT GCAGTGTGTG AAAAAGGGAA TTTCAACGTG GCCCATGGGC TGQCATGGTC 480 

35 ATATTACATC GGATATCTGC GGCTGATCCT GtCAGAGCTC CAGGCCCGGA TTCGAACTTA 540 

CAATCAGCAT TACAACAACC rPGCTACGGGG TGCAGTGAGC CAGCGGCTGT ATATTCTCCT 600 

CCCATTGGAC TGTQGGGTGC CTGATAACCT GAGTATGGCT GACCCCAACA TTCGCTTCCT 660 

40 

GGATAAACTG CCCCAGCAGA CCGGTGACCG TGCTGGCATC AAGGATCGGG TTTACAGCAA 720 

CAGCATCTAT GAGCTTCTGG AGAACGGGCA GCGGGCGGGC ACCTGTGTCC TGGAGTACGC 780 

45 CACCCCCTTG CAGACTTTGT TTGCCATGTC ACAATACAGT CAAGCTGGCT TTAGOGGGGA 840 

GGATAGGCTT GAGCAGGCCA AACTCTTCTG CCGGACACTT GAGGACATCC TGGCAGATGC 900 

CCCTGAGTCT CAGAACAACT GCCGCCTCAT TGCCTAOCAG GAACCTGCAG ATGACAGCAG 960 

50 

CTTCTCGCTG TCCCAGGAGG TTCTCCGGCA CCTQCGGCAG GAGGAAAAQG AAGAQGTTAC 1020 

TCrrGGGCAGC TT6AAGACCT CAGOGGTGCC CAGTACCTCC ACGATGTCCC AAGAGCCTGA 1080 

55 GCTCCTCATC AGTGGAATGG AAAA3CCCCT CCCTCTCCGC ACGGATTTCT CTTGAGACCC 1140 

AGGGTCflCCA GGCCAGAQCC TCCAGTQGTC TCCAAGCCTC TGGACTQGGG GCTCTCTTCA 1200 

GTGGCTGAAT GTCCAGCAGA GCTATTTCCT TCCACAGGGG GCCTTGCftGG GAAGGGTCCA 1260 

60 



wo 98/54963 



360 

GGACTTGACA TCTTAAGATG CXJTCTTGTCC CCTTGGGCCA GTCATTTCCC CTCTCTGAGC 
CTCGGTGTCT TCAACCTGTG AAATGGGATC ATAATCACTG CCTTACCTCC CTCACGGTTG 
TTGTGAGGAC TGAGTGTGTG GAAGTTTTTC ATAAACTTTG GATGCTAGTG TACTTAQQGG 
GTGTGCCAGG TGTCTTTCAT GGGGCCTTCC AGAOCCACTC CCCACCXrTTC TCCCCTTCCT 
TTGCCXX3GGG ACX3CCXSAACT CTCTCAATGG TATCAACAGG CTCCTTCGCC CTCTQGCTCC 
TGGTCATGTT CCATTATTGG GGAGCCCCAG CAGAAGAATG GAGAGGAGGA GGAGGCTGAG 
TTTGGGGTAT TGAATCCCCC GGCTCCCACC CTGCAGCATC AAGGTTGCTA TGGACTCTCC 
TGCCGGGCAA CTCTTGCGTA ATCATGACTA TCTCTAGGAT TCTGGCACCA CTTCCTTCCC 
TGGCCCCTTA AGCCTAGCTG TGTATCGGCA CCCCCACCCC ACTAGAGTAC TCCCTCTCAC 
TTGCQGTTTC CTTATACTCC AOXCTTTCT CAACGGTCCT TTTTTAAAGC ACATCTCAGA 
TTAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAGGG CGGCCGC 

(2) INFORMATION FOR SEQ ID NO:. 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
ATGAATTAAC GCCAAGCTNT NAATAGGGAC TCACTATGGG GGAAAGNTGG GTAACGCCTG 
CAGGTACCGT TCCGGAATTC CCGGGTCGAC CCAOGCGTCC GATGGGGCTT TAGTAAATCA 
GGCTTGCAGG CTCAAAGCTG CAATCTQCCC ACTCTCAGGT ACTGAGACTT TGTQGGCCTC 
AGACACCAGG AASAAAGTTG GGATACAGTC ATTTGAGTTA AAAAGGGAAT GACCCCTCAG 
AAACCCGCAT TAQCAGTGTT ACTCTTQGAA GTGCCTTTAC TTTTAACGCT CTCTGTTCTG 
AAAAAGAGGT GTTTGGTTAC GTGTGAGCCA ACATCACGTT TTGTTAGCTG TGATTTACCT 
TTGTCCGTTT AAAAGACTTC ACGGAGCCAT TCTGTATACA AGGTGTGCTC TTTCCAATGT 
AGAAGGGGTT ATGGAAAAGG GTGCGATCCT TTGCTGTAAA CTGGAGAGAC CAGTCCCAAA 
CAGAGGQGAA TTTTAAGCCC TTCTCATCAC CCAATTGGAT GTTTTTGCTT ATAQCAAATT 
CCTGCAAAAT AAATAAATAA ATATITGCAA AACTAAAAAA AAAAAAAAAA AAAAAAAAAA 
GGGGGCarcCN C 
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(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUE^K:E CHARACTERISTICS: 

(A) LENGTH: 2632 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 110: 

TCCCAGCTCT CAGGACAAGG GCCCTGGGCG ATCTTTTAAA AAAGCCGATT GGGTGTCTTT 60 

CTAAAAOTAC AACCAGTACT .TCATCGTCAA GTTTCTGGC5A AGGGAGTCCC CTCCAGATTC 120 

15 TCATGGAGTG ACAAATCTTG ACTCTTGCTC CTGGAATTTT TCAGGCCCAA ACTAGCGTTT 180 

CTACAATGAT TTATTTGGCA AATTTGTCTT GATTATGGGT GGCTGATGAG GAACGTGCTT 240 

TTC3TTAQGAA CCGAAACTGG GCGGCGGTGA GGGCGTGTAC GCAATGAGTC CGGAAGAGGG 300 

20 

TGAAATGCTT TCGGTAGGCA CTCCACGGCT GTGAAGATGG OGGCGGCTGC GTGGCTTCAG 360 

GTGTTGCCro TCATTCTTCT GCTTCTQGGA GCTCACCCGT CACCACTGTC GTTTTTCAGT 420 

25 GCGGGACCGG CAACCGTAGC TGCTGCCGAC CGGTCCAAAT GGCACATTCC GATACCGTCG 480 

GGGAAAAATT ATTTTAGTTT TGGAAAGATC CTCTTCAGAA ATACCACTAT CTTCCTGAAG 540 

TTTGATGGAG AACCTTGTGA CCTGTCTTTG AATATAACCT GGTATCTGAA AAGCGCTGAT 600 

30 

TGTTACAATG AAATCTATAA CTTCAAGGCA GAAGAAGTAG AGTTGTATTT GGAAAAACTT 660 

AAGGAAAAAA GAGGCTTGTC TGGGAAATAT CAAACATCAT CAAAATTGTT CCAGAACTGC 720 

35 AGTGAACTCT TTAAAACACA GACCTTTTCT GGAGATTTTA TGCATCGACT GCCTCTTTTA 780 

GGAGAAAAAC AGGAGOCTAA GGAGAATGGA ACAAACCTTA CCTTTATTGG AGACAAAACC 840 

GCAATGCATG AACCATTGCA AACTTGGCAA GATGCACCAT ACATTTTTAT TGTACATATT 900 

40 

GGCATTTCAT CCTCAAAGGA ATCATCAAAA GAAAATTCAC TGAGTAATCT TTTTACCATG 960 

ACTGTTCSAAG TGAAGGGTCC CTATGAATAC CTCACACTTG AAGACTATCC CTTGATGATT 1020 

45 TTTTTCATGG TGATGTGTAT TGTATATGTC CTGTTTGGTG TTCTGTQGCT GGCATQGTCT 1080 

GCCTGCTACT GGAGAGATCT CCTGAGAATT CAGTTTTGGA TTGGTGCTGT CATCTTCCTG 1140 

GGAATGCTTG AGAAAGCTGT CTTCTATGCG GAATTTCAGA ATATCCGATA CAAAGGARAA 1200 

50 

TCTGTCCAGG GTGCTTTGAT CCTTGCAGAR CTGCTTTCAG CAGTGAAACG CTCACTGGCT 1260 

CGAACCCTGG TCATCATAGT CAGTCTGGGA TATGGCATCG TCAAGCCACG CCTGGAGTCA 1320 

55 CTCTTCATAA GGTTGTAGTA GCAGRAGCCC TCTATCTTTT GTTCTCTGGC ATGGAAGGGG 1380 

TCCTCAGAGT TACTGGGGCC CAGACTGATC TTGCTTCCTT GGCCTTTATC CCCTrGGCTT 1440 

TCCTAGACAC TGCCTTGTGC TGGTQGATAT TTATTAGCCT GACTCAAACA ATGAAGCTAT 1500 

60 
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10 



15 



20 



25 



30 



35 



TAAAACTTCG GAGGAACATT GTAAAACTCT CTTTGTATCG GCATTTCACC AACACGCTTA 1560 

TTTTGGCAGT GGCAGCATCC ATTGTGTTTA TCATCTGGAC AACCATGAAG TTCAGAATAG 1620 

TGACATQTCA GTCQGACTGG OSGGAGCTGT GGGTAGACGA OXSCCATCTGG CGCTTGCTGT 1680 

TCTCCATCAT CCTCTTTGTC ATCATGGTTC TCTGGCXyUrC ATCTGCAAAC AACCAGAGGT 1740 
TTGCXTTTTTC ACCATTGTCT GAGGAAGAGG AGGAGGATGA ACAAAAQGAG CXTTATGCTGA ' 1800 
AAGAAAGCTT TGAAGGAATG AAAATGAGAA GTACCAAACA AGAACCCAAT GGAAATAGTA . 1860 

AAGTTAACAA AGCACAGGAA GATGATTTGA AGTQGGTAGA AGAGAATGTT CCTTCTTCTG 1920 

TGACAGATGT AGCACTTCCA GCCCTTCTGG ATTCAGATGA GGAACGAATG ATCACACACT 1980 

TTGAAAGGTC CAAAATGGAG TAAGGAATGG GAAGATTTGC AGTTAAAGAT GGCTACCATC 2040 

AGGGAAGAGA TCAGCATCTG TGTCAGTCTT CTGTACGGCT CXZATGGGATT AAAGGAAGCA 2100 

ATGACATCXT GATCTGTTCC TTGATCTTTG GGCATTGGAG TTGGCXaAGAG GTGTCAGAAC 2160 

AAAGAGAACA TCTTACTGAA AACAAGTTCA TAAGATGAGA AAAATCTACG AGCTTCTTAT 2220 

TTACAACACT GCTGCCCCCT TTCCTCCCAG ACTCTGACAT QGATGTTCAT GCAACTTAAG 2280 

TGTGTTGTTC CTGAACTTTC TGTAATGTTT CATTTTTTAA ATCTGACAAA CTAAAAAGTT 2340 

TAACGTCTTC TAAAAGATTG TCATCAACAC CATAATATGT AATCTCCAGG AGCAACTGCC 2400 

TGTAATTTTT ATTTATTTAG GGAGTTACAT AGGTGATGGG GGAAATTGTT AACTACCTTT 2460 

CATTTTCCTG GGAAGTCAAG GTTACATCTT GCAGAGGTTO TTTTCAGAAA AAAGGGC!CCT 2520 

TCTGAGTTAA GGAGCCATAG TTCTATCAAT GATCAAAAGA AAAAAAAAAA AACTCGATCXS 2580 

GCACXSAGGGG GGGCCCX3GTA CCXirAATTCGC CXTTATGCGAN TCGAATGAGA CC 2632 
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(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CKARACTEEaSTICS : 

(A) LENGTH: 2249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEIXJESS : double 

(D) TOPOLOGY: linesir 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
GAATTOQGCA CGAGCTCACC GTGCTGCGTG ACACAAGGCC AGCCTGOGCC TACGAGCCCA 
TQGACTTTKT RATGGCCCTC ATCTAOGACA TGGTACTGSW TGTGGTCACC CTQGGGCTQG 
CCCTCTTCAC TCTGTGOGGC AAGTTCAAGA GGTQ6AAQCT GAACGGGGCC TTCCTCCTCA 
TCACAGCCTT CCTCTCTGTG CTCATCTGGG TGGCCTGGAT GACCATGTAC CTCTTCGGCA 
ATGTCAAGCT GCAGCAGGGG GATGCCTGGA ACGACCCCAC CTTGGCCATC ACGCTGGCGG 



60 
120 
180 
240 
300 
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CXAGCGCTGG GTCTTCGTCA TCTTCCACGC CATCCCTGAG ATCCACTGCA CCCTTCTQCC 360 

AGCCCTGCAG GACAACACGC CCAACTACTT CGACACGTCG CAGCCCAGGA TGCX9GGAGAC 420 

5 

GGCCTTCGAG GAGGACGTGC AGCTGCCGCG GGCCTATATG GAGAACAAGG CCTTCTCCAT 480 

GGATGAACAC AATGCAGCTC TCCGAACAGC AGGATTTCCC AACGGCAGCT TGGGAAAAAG 540 

10 ACCCAGTGGC AGCTTGGGGA AAAGACCCAG CGCTCCGTTT AGAAGCAACG TGTATCAGCC 600 

AACTGAGATG GCCGTCGTGC TCAACGGTGG GACCATCCCA ACTGCTCCGC CAAGTCACAC 660 

AGGAAGAMAC CTTTGGTGAA AGACTTTAAG TTCCAGAGAA TCAGAATTTC TCTTACCGAT 720 

15 

TTGCCTCCCT GGCTGTGTCT TTCTTGAGGG AGAAATCGGT AACAGTTGCC GAACCAGGCX: 780 

GCCTCACAGC CAGGAAATTT GGAAATCCTA GCCAAGGGGA TTTCGTGTAA ATGTGAACAC 840 

20 TGACGAACTG AAAAGCTAAC ACCGACTGCC CGCCCCTCCC CTGCCACACA CACAGACACG 900 

TAATACCAGA CCAACCTCAA TCCCCGCAAA CTAAAGCAAA GCTAATTGCA AATAGTATTA 960 

GGCTCACTGG AAAATGTGGC TGGGAAGACT GTTTCATCCT CTGGGGGTAG AACAGAACCA 1020 

25 

AATTCACAGC TQGTGGGCCA GACTGGTGTT GGTTGGAGGT QGGGGGCTCC CACTCTTATC 1080 

ACCTCTCCCC AGCAAGTGCT GGACCCCAGG TAGCCTCTTG GAGATGACCG TTGC3GTTGAG 1140 

30 GACAAATGGG GACTTTGCCA CCGGCTTTGC CTGQTGGTTT GCACATTTCA GGGQGGTCAG 1200 

GAGAGrrTAAG GAGGTTGTGG GTGQGATTCC AAGGTQAGGC CCAACTQAAT CX5TGGGGTGA 1260 

GCTTTATAGC CAGTAGAGGT GGAGGGACCC TGGCATGTGC CAAAGAAGAG GCCCTCTGGG 1320 

35 

TGATGAAGTG ACXATCACAT TTGGAAAGTG ATCAACCACT GTTCCTTCTA TGGGGCTCTT 1380 

GCTCTAGTGT GTATGGTGAG AACACAGGCC CXX3CCCCTTC CCTTGTAGAG CCATAGAAAT 1440 

40 ATTCTGGCTT GGGGCAGCAG TCCCTTCTTC CCTTGATCAT CTCGCCCTGT TCCTACACTT 1500 

ACGGGTGTAT CTCCAAATCC TCTCCCAATT TTATTCCCTT ATTCATTTCA AGAGCTCCAA 1560 

TGGGGTCTCX: AGCTGAAAN5 CCCTCCX3GGA GGCAGGTTGG AAGGCAGGCA CX:ACGGCAGG 1620 

45 

TTTTCCGCGA TGATGTCACC TAGCAGGGCT TCAGGGGTTC CCACTAGGAT GCAGAGATGA 1680 

CCTCTCGCTG CCTCACAAGC AGTGACACCT CGGGTCCTTT CCGTTGCTAT GGTGAAAATT 1740 

50 CCTGGATGGA ATGGATCACA TGAQGGTTTC TTGTTGCTTT TGGAGGGTGT GGGGGATATT 1800 

TTGTTTTGCT TTTTCTGCAG GTTCCATGAA AACAGCCXTTT TTCCAAGCXX: ATTGTTTCTG 1860 

TCATGGTTTC CATCTGTCCr GAGCAAGTCA TTCCTTTGTT ATTTAGCATT TCGAACATCT 1920 

55 

CGGCCATTCA AAGCXXXCAT GTTCTCTGCA CTGTTTGGCC AGCATAACCT CTAGCATOGA 1980 

TTCAAAGCAG AGTTTTAACC TGACXX3CATG GAATGTATAA ATGAGGGTQG GTCTTTCTGC 2040 

60 AGATACTCTA ATCACTACAT TGCTTTTTCT ATAAAACTAC CXATAAGCCT TTAACCTTTA 2100 
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AAGAAAAATG AAA^AGGTTA CTSTTTGGGG G-CCGGGGGAG GACTGACCGC TTCATAAGCC 2160 

AGTACGTCnG AGCTGAGTAT GmCAATAA ^rCTTTTGAT ATTTCTCAAA AAAAAAAAAA 2220 

AAAAAMCCCG GGGG3GGC-::C CQSCCT^G 2249 

(2) INFC3MATIGH FC?. SEQ IT TO: 112: 



(i) S2QUZNCE C-3i?ArrE?^S7ICS : ' 

(A) Z^KTTr.: 2153 bcLse pairs 
15 (3) TYPE: nucleic acid 

O) "iroPCLOSV; linear 



(xi) SECUENCS CESCrJPTlCN: SEQ ID NO: 112: 

GATACTATAA GCSCAAGTGAC TCACGGGCGC GCCGTTAGAC TAGTGGATCC CGGGTGCAGG 60 

AATTCGGCAG AGCGCCGCCilG GAGCCGAAGT GCTGGCGCCC CCGCGGCCGC TGCCTCCGCG 120 

25 GANCCCAAAA TCATGAA;CT CACCGTG?JiJ5 ACCCCGAAGA AAAGGAGGAA TTCGCCGTGC 180 

CCGAGAATAG CTCCGTCCyS CAETITAAGG iAGAAATCTC TAAACGTTTT AAATCACATA 240 

CTGACCAACT TGT(?rTa-.rA TTTGCTGGAA AAATTTTGAA AGATCAAGAT ACCTTGAGTC 300 

30 

AGCATGGAAT TCATGATGSA CXT.CTGTTC ACCTTGTCAT TAAAACACAA AACAGGCCTC 360 

AGGATCATTC AGCTCAGCiA ACAiATACAG CTGGAAGCAA TGTTACTACA TCATCAACTC 420 

35 CTAATAGTAA CZCTACACCT GGr?:CTGCTA CTAGCAACCC TTTTGGTTTA GGTGGCCTTG 480 

GGQGACTTGC AGG?CTG?J3T AGCTTGGGTT TGAATACTAC CAACTTCTCT GAACTACAGA 540 

GTCAGATGCA GCGACAACTT TTGTCTAACC CTGAAATGAT GGTCCABATC ATGGAAAAWC 600 

40 

CCYTTGTTCA a^iGCATGCTC iJTCiAATCC? GACCTGATGN AGACAGTTAA TTATGGCCAA 660 

TCCACAAATG C^JGCAGTr-:^ TACAGAGAAA TCCCAGAAAT TAGTCATATG TTGAATAATC 720 

45 CAGATATAAT GAGACAAA::G TTGGAACTTG CCCAGGAATC CAGCAATGAT GCAGGAGATG 780 

ATGAGGAACC AGGACCGA3C TTTGAGCAAC CTAGAAAGCA TCCCAGGGGG ATATAATGCT 840 

TTAAGGCGCA TCTACACAGA TAITCAGGAA CX^ATGCTGA GTCCTGCACA AGAGCAGTTT 900 

50 

GGIGGrrAATC CATTTGC?rC CTTSSTGaGC AATACATCCT CTGGTGAAQG TAGTCAACCT 960 

TCCCGTACAG AAAATAGiGA TCCCTACCC AATCCATGGG CTCCACAGAC TTCCCAGAGT 1020 

'55 TCATCAGCTT CC\GCGGCAC TGCCAGCACT GTGGGTGGCA CTACTGGTAG TACTGCCAGT 1080 

GGCACTTCTG GGCAGAGZAC TACTGCGCCA AATTTGGTGC CTGGAGTAGG AGCTAGTATG 1140 

TTCAACACAC CAGGAATC-CA GAJ5CTTGTTG CAACAAATAA CTGAAAACCC ACAACTTATG 1200 

60 
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CAAAACATGT TCTCTGCCCC CTACATGAGA AGCATGATGC AGTCACTAAG CCAGAATCCT 1260 

GACCTTGCTG CACAGATGAT GCTGAATAAT CCCCTATTTG CTGGAAATCC TCAGCTTCAA X320 

5 GAACAAATGA GACAACAGCT CCCAACTTTC CTCCAACAAA TGCAGAATCC TGATACACTA 1380 

TCAGCAATGT CAAACCCTAG AGCAATGCAG GCCTTGTTAC AGATTCAGCA GGGTTTACAG 1440 
ACATTAGCAA CGGAAGCCCC GGGCCTCATC CCAGGGTTTA CTCCTOGCTT GGGGGCATTA " 1500 

10 

GGAAGCACTG GAGGCTCTTC GGGAACTAAT QGATCTAACG CCACACCTAG TGAAAACACA 1560 

AGTCCCACAG CAGGAACCAC TGAACCTGGA CATCAGCAGT TTATTCAGCA GATGCTGCAG 1620 

15 GCTCTTGCTG GAGTAAATCC TCAGCTACAG AATCCAGAAG TCAGATTTCA GCAACAACTG 1680 

GAACAACTCA GTGCAATGGG ATTTTTGAAC CGTGAAGCAA ACTTGCAAGC TCTAATAGCA 1740 

ACAGGAGGTG ATATCAATGC AQCTATTGAA AGGTTACTGG GCTCCCAGCC ATCATAGCAG 1800 

20 

CATTTCTGTA TCTKGAAAAA ATGTAATTTA TTTTTGATAA CGGCTCTTAA ACTTTAAAAT 1860 

ACCTGCTTTA TTTCATTTTG ACTCTTGGAA TTCTGTGCTG TTATAAACAA ACCCAATATG 1920 

25 ATGCATTTTA AGGTGGAGTA CAGTAAGATG TGTGGGTTTT TCTGTATTTT TCTTTTCTGG 1980 

AACAGTQGGA ATTAAGGCTA CTGCATGCAT CACTTCTGCA TTTATTGTAA TTTTTTAAAA 2040 

ACATCACCTT TTATAGTTGG GTGACCAGAT TTTGTCCTGC ATCTGTCX:AG TTTATTTGCT 2100 

30 

TTTTAAACAT TAGCCTATGG TAGTAATTTA TGTAGAATAA AAGCATTAAA AAGAAGCAAA 2160 

AAAAAAAAAA AAAAATTCCT GCX3CCCGCGA ATTCTTCT 2198 

35 

(2) INFORMATION FOR SEQ ID NO: 113: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1043 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIjOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
CTGAAGTGTA TGTQGTGAGG AAGAAGAGGC TCCTACTGTA GACAGCCTTG TTCTACAGAT 60 
50 CCTCCCAGAA ATCTCTQGGC CAGGTGGAAC CCAGGGTCAG AGAGGGATGG GAGAGAGGTT 120 
TAATTTTCCA TGATAAATAA AAATCTATAA AATAATAAAC AAGAGAAAAG AGATTGGAAA 180 
CAGCCAGGTT GGAGCAGTGA GrPGAGTAAGG AAACCTGGCT GCCCTCTCCA GATTCCCCAG 240 

55 

GCTCTCAGAG AAGATCAGCA GAAAGTCTGC AAGACCCTAA GAACCATCAG CCCTCAGCTG 300 
CACCTCCTCC CCTOCAAGGA TGACAAAGGC GCTACTCATC TATTTQGTCA GCAGCTTipr 360 
60 TGCCCTAAAT CAGGCCACCC TCATCAGTCG CTGTGACTTG GCCCAGGTGC TGCAGCTQGA 420 
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GiWnGACTGG CTGTGCCTGG CTTTTQTGGA 480 

AAATGCAGAT GGAAGCTTTG ACTATQGSCT 540 

CRATTATAAG AGTTACTCGG AAAACCTTTG 600 

CAACCTTCTT GCAGGCATCC ACTGCGCAAA 660 

CAACTQGGTT AGAATGGAAG KTTGCACTGT 720 

GGATGCCGCC TGAGATKAAA CARGGTGCGG 780 

TCCTCACTCA RGGATTCTTC ATTTCTTCTT 840 

CCCTTCCCAT TTACAACTAA AACTGACCAG 900 

CCTCCTTTACT CCCATCTGGA CCCAGTCCCC 960 

ACCACAATAA AGAAATCTTT ATATTTATCG 1020 

1043 

25 

(2) INFOFMATICSN FOR SBQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 703 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

GAATTCQGCA CGAGTGCQCG GGCACCACGG 0CX3TTTTTCG ACGCTGGCGG TGGACGCAGG 60 

CAGCATGGAC CACGGTTGCT GGGCGGATGG OGAGCGTCTA TGGTCAGTTG CCTTAGAAGT 120 

40 

GGTGAGATGG GAAGCTGCAG TTGGAAGACC CTGGAGGATG CCTGACAAGG GGATGTCTGA 180 

CACATGATTG GAGCTCTTTT TGAAATGTTT CTTGCCCTTC CTGGAGCAGA GGAGCCATTA 240 

45 TTTATGCAGG TACATCGAAG TCTTTTGACC TCCATACAGT GATTATQCTT GTCATCGCTG 300 

GTQGTATCCT GGCGGCCTTG CTCCTGCTGA TAGTTGTCGT GCTCTGTCTT TACTTCAAAA 360 

TACACAACGC GCTAAAAGCT GCAAAGGAAC CTGAAGCTGT GGCTGTAAAA AATCACAACC 420 

50 

CAGACAAOGT GTGGTGGGCC AAGAACAGCC AGGCCAAAAC CATTGCCACG GAGTCTTGTC 480 

CTGCCCTGCA GTGCTGTGAA QGATATAGAA TGTGTGCCAG TTTTQATTCC CTGCCACCTT 540 

55 GCTGTTGCGA CATAAATGAG GGCCTCTGAG TTAGGAAAGG TGGGCACAAA AATCTTCATG 600 

AGCAATACTT. CTTAGTAGAT TGTTTTGTTA TTCAAATCAA GTTCTAGTGT TTTTATGTGA 660 

GATTATATAA TTTACAGTGT TGTTTTATAT ACTTTTGAAT AAA 703 



RGACTTGGAT QGGTTTGAGG GTTACTCCCT 
AAGCAAGTTC AACATATCAA AGAOWAATGA 

5 

CTTCCAGATC AACAGCCACT ACTGGTGCAA 
CCACGTAGAC TGTCAAGATC TGCTGAATCC 
10 AAGGATTGTG TCCGGAGCAC GGGGGATGAA 
TCAGGCCGGC CACTCTTCTA CTGGCTGACA 
GTGCACCGTG GARTCATTCC AAGACTCCTG 

15 

CCTACTGCCT CCACTTCATG TTATTTTCTT 
AGCCCCAQGA ATAAATGGTT TTCTTGGCTT 
20 TGGTTCCTGT CTGTTATTTG TAAACTGAGG 
AAAAAAAAAA AAAAAAAACT CGA 
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(2) INPORMATi™ FOR SEQ ID NO: 115: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
10 (D) T0P0U5GY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

GGCAGAOGGG GCATGAGCAG GAGGAGGATT ACCGCTACGA GGTGCTCACG GCCGAGCAGA 60 

15 

TTCTACAACA CATGGTGGNA ATGTATCCGG GAGGTCAACG AGGTCATCCA GAATCCAGCA 120 

ACTATCACAA GAATACTCCT TAGCCACTTC AATTGGGATA AAGAGAAGCT AATQGAAAQG 180 

20 TACTTTGATG GAAACCTGGA GAAGCTCTTT GCTGAGTGTC ATGTAATTAA TCCAAGTAAA 240 

AAGTCTCGAA CACGCCAGAT GAATACAAGG TCATCAGCAC AGGATATGCC TTGTCAGATC 300 

TGCTACTTGA ACTACCCTAA CTCGTATTTC ACTGGCCTTG AATGTGGACA TAAGTTTTGT 360 

25 

ATGCAGTGCT GGAGTGAATA TTTAACTACC AAAATAATGG AAGAAGGCAT QGGTCAGACT 420 

ATTTOGTGTC CTQCTCATGG TTGTGATATC TTAGTQGATG ACAACACAGT TATGCGCCTG 480 

30 ATCACAGATT CAAAAGTTAA ATTAAAGTAT CAGCATTTAA TAACAAATAG CTTTGTAGAG 540 

TGCAATCGAC TGTTAAAGTG GTGTCCTGCC CCAGATTQCC ACCATGTTGT TAAAGTCCAA 600 

TATCCTGATG CTAAACCTGT TCGCTGCAAA TGTQGGCGCC AATTTTGCTT TAACTGTQGA 660 

35 

GAAAATTGGC ATGATCCTGT TAAATGTAAG TGGTTAAAGA AATGGATTAA AAAGTGTGAT 720 

GATGACAGTG AAACCTCCAA TTGGATTGCA GCCAACACAA AGGAATGTCC CAAATGCCAT 780 

40 GTCACAATTG AGAAGGATGG TQGTTGTAAT CACATGGTCT GTCGTAACCA GAATTGTAAA 840 

GCAGAGTTTT GCTGGGrTGTG TCTTGGCCCA TGGGAACCAC ATOGATCTGC CTGGTACAAC 900 

TGTAACCGCT ATAATGAGGA TGATGCAAAG GCAGCAAGAG ATGCACAGGA GCGATCTAGG 960 

45 

GCAGCCCTGC AGAQGTACCT GrTCTACTGT AATCGCTATA TGAACCACAT GCAGAGOCTG 1020 

CGCTTTGAGC ACAAACTATA TGCTCAGGTG AAACAGAAAA TQGAGGAGAT GCAGCAGCAC 1080 

50 AACATGTCCT GGATTGAGGT GCAGTTCCTG AAGAAGGCAG TTGATGTCCT CTGCCAGrTGT 1140 

CGTGOCACAC TCATGTACAC TTATGTCTTC GCTTTCTACC TCAAAAAGAA TAACCAGTCC 1200 

ATTATCTTTG AGAATAACCA AGCAGATCTA GAGAATGCCA CAGAGGTGCT CTCGGGCTAC 1260 

55 

CTTGAACGAG ATATTTCCCA AGATTCTCTG CAGGATATAA AGCAGAAAGT ACAAGACAAG 1320 

TACAGATACT GTGAGAGTCG ACGAAGGGTT TTGTTACAGC ATGTGCATGA AGGCTATGAA 1380 

60 AAAGATCTGT GGGAGTACAT TGAGGACTGA GAATGGCCCT GCATAAAATG AACTCTGAAA 1440 
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ACTTTACCAT CTAGAGTGCT CATGCAATTA AAACAAAACA AACACAAACA AGGAGGCACT 1500 

^ AAGCCTATTC TGACACCACT GGTCTGTAGT ACCAGAATTG TTTTGTTAAT GGAAAGTTTA 1560 

AGTAAATTAT ATICTAATAA AAAGGTAGAT AAACCATTGT ACAACAGTAT TCTAGGCCGC 1620 

CAACAAAAGT GTGACAGACA CACTAAAAGC CCTCCAACTT TAACTTGTAA CGTAGCTTCA 1680 

10 TTCTCAAAGC TGACTCCTTT TTTTTCTTTT TCCTTTTCCT GAGTGTAGTA CAGTTAAAAT 1740 

TTCAAACAGC TCCTTGACAC TGCTTTTCAT' GTTCAAACCA GCCATTTTGT TGTACTTTGG 1800 

TAAAGGACCT CTTCCCCTTC CTCCCCTACA CATACAGATA CACCCACACA CAGACTGACT 1860 

CrCTTTCTCT CATACCXICAA GGTCATGAGT GAATGATGCT TAGTTCCTTG TAAAGAAAAT 1920 

CTTGGGATGG GGAAAGGGGT AGGCAGCAAG AGGATTCAAC AAACGAAAAA CATAAAAACT 1980 

20 TTGTATATGA CTTTTAAAAC AAGAGGACAA CACAGTATTT TTCAAAATTG TATATAGCGC 2040 
ATATGCATGG ACAAAGCAAG CGTGGCACGT GTTTGCATAA TGTTTAATTA CAAAAAAATA ■ . 2100 

TTTATTCTTT AAAAATCTTC AAGATTATGT CTATTTGCTG TGCATTTTCT TTCAGTTTGC 2160 

25 

TTATCTTTCC CGGGTTGGGG TTGGGATAAA GGTGTGTCGG TTTAGCACCT CTGGAAGACC 2220 

TATCTAGAGC TCTTTCACTT TCCTGAGGTT ATTTTGCXXY TTCTGGTGTT GGTATGTCTG 2280 



2340 



30 TTGCXX3GCCA TGGGCTNCAY GCXHTCAATT CX:TGCTCTTG ATCAGQGACA AGGGAGGTCA 

AGCTCTGACT AATGCX^TGA CCTGATTAAG QGGTACAGCA GGGAGTTTTG TTGCTACAGC 2400 

TCATGAATTA ACXTTGTCCCA AOCTAATCCC CCTCCATGGC ATCATGCCTC TACCCAAGCC 2460 

35 

TTTGTGTGCC CATGTTATGC ACACAGCTGT AGGCATTCTT AAGTCCCCTG TCGCATCCAG 2520 

TGGAAGCATT TTAAAATTTC TTTTACTTTT TGGTTTTCCC TTAATTGCTG CTTTTCAGAT 2580 

40 TTTAGTTATG GCTCGTCTGC TCACCCCTTC TCTACATTAG GGTGTCAAAG AGAATGTTTT 2640 

GCTTTAAATA TAAATAGCCA TTCATTTAGT CTCAGATTGT GAATTTAAAA TGGTGGATAC 2700 

CGAAATTGCT TGTGTGTGTT GCTGTGGGTT TQGTTTGAAG GCAAACACCC CTAGAACATG 2760 

45 

ATATTCCCAT CTAGTGCATT TAAATAGAAA TCACTGAGTT TGCTGCTTTT TTATTGTCAG 2820 

.CftGATAGGAG AATTAATAAT GCATTTTAGC TGTGATGTCC ATTTTTATGA AATTCCTACT 2880 

50 AAGAGCTATG TTAAAAGTAA AGGATGGTGG TGGTTGTATT AACTATATAC CTGTTTAGQC 2940 

CATTCTGGCT GTGGTATTTT TCAATAGGTC AGCATCTGTA AATCTGTCAG TTTTATACAG 3000 

GAGTGCAGAG TGAACTAGGC AACTAGATTA AGAGGTCTAA ATATGAAATA CCAGTTGAQG 3060 

CTGAQGACCT CTTCGTCTTC CTTTAAATGT CTTTPGCCTA GGGAGTGTTT ACCATTTGTG 3120 

AGGCAGCITT GTCTQCTCTT ACACTGTACA TCCTATTACT CCATTGGGAA GTAGGTTCAC 3180 

60 TTTCCrCPGG CCTTTTGCCT AAGTTAGGCT TTGCPGAATC AACCCTACTT TTCCTTTTAG 3240 
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AAAAGGTTGT TACAGGAGAT TTACTGGCAA 
GTTTGCtGAG TATAAATGCT GCTTCCTTAA 
TGTACCTTTT CTCCTTTCCT CCCTTGCCAC 
TTCTTCCATT TTCTCGTCTC TCTCCCCTCT 
TTCAAATGAC AGCATCAATC TTAAAAAGAT 
AAAGCCTGAA TAAGTTCCTT TCCCTGGTAA 
ATATATGTGG CTCCTTTAAA ATGCPTTGTG 
TTCGGGGGGG GGCCCQGTNC CCAT 



CTGrrrCTTTT CCCATCAAAA ATCAGTGAAT 
ACCACTTCTC GCTTTAOGAT CAACTTTACC 
CTCAQGTGCA AATCTGAACT CAGTGTCTGC 
TCCCCCATTA TCCATATGAC ATTATTTTAC 
ATACATTAAA ACTAAGGAGT TTTTTTAAAG 
CTTTGAAAAG CAGTCAGAGT TGCTATATAG 
TATGTGTGGT GTTTAAAAAA AAAAAAAAAA 



(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1965 base pairs 

(B) TYPE: nucleic acid 

(C) STHANOEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
AAGAAAGGGT ATTAAAATTC TAGATCACAT ATGGACCCGG GAAGGTTTTT NACCCTCTGT 
TAGTGACATC GAGTCTCCCA CTAGACAAAA TAGGTGGAAA AATCTCTCGA GGGCTCACAT 
TGTTTPGTCA TCTTCAGGAA AAACACCACC AGGCCATACC ACAGCCTGCC CAGTGAGGOG 
GTCTTTGCCA ACAGCACCGG GATQCTQGTG GTGGCCTTTG GGCTGCTGGT GCTCTACATC 
CTTCTGGCTT CATCTTGGAA GCGCCCAGAG OCGGGiSATCC TGACCGACAG ACAGCCCXTTG 
CTGCATGATG GOGAGTGAAG CAGCAGGAAG GGGCTCCCAA GAGCTCCTGG TGGTGCAGCC 
TGTGCTCCCC TCAGAAGCTC TGCTCTTCCC AGGGCTCCCG GCTGGTTTCA GCAGGCGACT 
TTCTTCCAAT GCTQGGCCCA GACTTCTTGC CTGGGTGCTG GCCTGCCCTC TCCGGNCCGC 
TTGCTGCCTG TCTQCTTTCC TTGGTGGYTT TGCTQGGTGC TGGGCCTGCC CTCTCCQGCC 
GCTTQCTGCC TOlVia.TiT CCTTC3GTGGC TTTGCTGGGT GCTGGGCCTG CCTTCTCTGG 
CTGCTTGCTG CTTGTCTGCr TTCCTTQGTG GCTTTGGCTT CTGCACTCCT TGGCGTCASC 
TCTCAGGTCC TCCATTCACA CGAGGTCCTC CTOGCTCTGG CCGCTCTTGC TGCTCCTGTC 
TGAAGAWATC AGACTGATTT CCTCTTAAGA CTCCTAGGGA TGTGGTGAAG AGCTGG6ACT 
CAAGTGCAGT CCACGGTGTG AAACATGAGG GARGTGAGGT GTCCGTCCAC TTCCCCCATA 
AAQGTGTGCA TTTCAGTTAG GCTGCCCCGC CACAGAGCAG GCTTCATCTG CTCTGCCATC 
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CAGCCCCATC TGGATGTGAG GTGGGGTGGA GACATCATGG GGTGATTGCA GAAAGGGGGA 960 

GTGGCGGCCC ACGCAGCTTC TGCTGAGGAG CTGACCGCTC TGAGCTGTTC TGTTTCGTAT 1020 

TGCTGCTCTG TGTCTGCATG TATTGTGACC GTGCGGCTCC ACCTCTTCCA GCTGCTGCTA 1080 

CAGCTGAGGC CTQGATCCCG GCCTTTCCCT GTGACTTACG TGTCTGTCAC CGGCANGCAG 1X40 

CCCTACAAAT CCTGGTGACC TGCTCTCCCA AGAACAGAGC CIGTCCCCAG ATGTCCCAGT 1200 

AGCXJATGAGT AACAGAQGTG GCTGTGGACT TCCTCTACTT CTCCTTGCTG GATCAGGGCC 1260 

TTCCTGCCTC CCGCTGQGCA GGTCTQGCCT TGCTCTCTTG GCAGGGCCCC AGCCCCTCTG 1320 

15 ACCACTCTGC AGCTCACCAT GCAGCTGATG CCAAAGTTGT GGTGTCCAGT GTGCAGCAGC 1380 

CCTGGGAGCC ACTGCCACCT TCAGAGGGGT TCCTTGCTGA GACCCACATT GCTTCACCTG 1440 

GCCCCACCAT GGCTGCTTGC CTGGCCCAAC CTAGCGTTCT GTGCCATGCT AGAGCTTGAG 1500 

20 

CTGTTGCTCT TCTTCAGGGG AGGAAATAGG GTGGAGAGCG GGAAGGGTCT TGCTCCTAAG 1560 

TGTTGCTGCT GTGGCTTTTT TGCCTTCTCC AAAGACGCAC TGCCAGGTCC CAAGCTTCAG 1620 

25 ACrGCTGTGC TTAGTAAGCA AGTGAGAAGC CTGGQGlTrG GAGCCCACCT ACTCTCTGGC 1680 

AGCATCAGCA TCCTACTCCT GGCAACATCA GGCCAACGTC CACCCCAGCC TCACATTGCC 1740 

AGATGTTGGC AGAAGGGCTA ATATTGACCG TCTTGACTGG CTGGAGCCTT CAAAGCCACT 1800 

30 

GGGATGTCCT CCAGGCACCT GGGTCCCATG ACCAGCTCCC CGTCTCCATA GGGGTAGGCA 1860 

TTTCACTQGT TTATGAAGCT CGAGTTTCAT TAAATATGTT AAGAATCAAA GCTGTCTTTG 1920 

35 TTCAQGCTGC TATAACAAAA ATATAATAGC CTGGGTGGCT TAAAC 1965 



40 (2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

50 AGTGATCCCC TTGCCTCGGC CTCCCAAAAT GCTGGAATTG TAAGCGTGGG CCTCTGCACC 60 

CGGCCTQGTC CGCAATTTAA AAACGCACAG CCACCATTCC CTYTCCAGAA AGCACCCAGA 120 

TGCCTTTGGG AGAACCAGCC TCCTCCATGG AGGAAAGdT GGGATCTGCC TTCCCACCTG 180 

55 

GGGAGGAGAG GGATCTGTQG AAAATCCTTC TGACGGACTT CCCCTCAGTG CCTGATCCAT 240 

ACTCAATAGT AGAAAAAGTA AGAAATATAC AAAGATAGCA GATACACGGA GACAGTTQCC 300 

60 CAAATAGCTG AGCGAWTAGC GCAGAAGCAA TATTGAAGAC CTAATAGCTG AGACATTTCC 360 
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AGAACTGATA AAGTGCATCC AGCCACAGAT CAAGCAGCCC AGAAAATTCC AGGCAGCATC 420 
AACAAATAAA TAGCXXXACA TGCACCC3GTG AAAATGCAGA AGACCAAACA AAAAAGTCCX3 480 
GTCAACAGCC AGAGTTAAAG AGG 503 



10 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1133 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEIMISS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

20 

C3GCACAGCTT GGAATGAACC CCTGTGGATA AGGGGGACTA TTAGATAGAA TAAACATCAA 60 

TAAATGCTTG ATGAATAAAC GCTAATCCTA CCTTCCCAGC CTGACACCTC CCAGTGGACA 120 

25 CCACACTTCA CTTGAAGCCT TAGAAACCTT TCCCACCCAT GCTTCCAGCC CTGGCTTCAT 180 

GTTGCCATTT CTCACCCCCA GAACAGGCCG CCCGCCTGAA GAAACTACAA GAGCAAGAGA 240 

AACAACAGAA AGTGGAGTTT CGTAAAAGGA TGGAGAAGGA GGTGTCAGAT TTCATTCAAG 300 

30 

ACAGTGQGCA GATCAAGAAA AAGTTTCAGC CAATGAACAA GATCGAGAGG AGCATACTAC 360 

ATGATGTQGT GGAAGTGGCT GGCCTGACAT CCTTCTCCTT TGGGGAAGAT GATGACTGTC 420 

.35 GCTATGTCAT GATCTTCAAA AAGGAGTTTG CACCCTCAGA TGAAGAGCTA GACTCTTACC 480 

GTCGTGGAGA GGAATQGGAC CCCCAGAAGG CTGAGGAGAA GCGGAACMTG AAGGAGCTGG 540 

CCCAGAGGCA ANGAGGAGGA GGCAGCCCAG CAGGGGCCTG TGGTGGTGAG CCCTGCCAGC 600 

40 

GACTACAAGG ACAAGTACAG CCACCTCATC GGCAAGGGAG CAGCCAAAGA CGCAGCCCAC 660 

ATGCTACAGG CCAATAAGAC CTACGGCTGT KTGCCCGTGG CCAATAAGAG GGACACACGC 720 

45 TCCATTGAAG AGGCTATGAA TGAGATCAGA GCCAAGAAGC GTCTGCGGCA GAGTGGGGAA 780 

GAGTTQCCGC CAACCTCCTA GGCGCCCCGC CCAQCTCCCT TTGACCCCTG GGGCAGGGCA . 840 

GQGGGCAGGG AGAGACAAGG CTGCTGCTAT TAGAGCCCAT CCTGGAGCCC CACCTCTGAA 900 

50 

CCACCTCCTA CCAGCTGTCC CTCAGGCTGG GGGAAAACAG GTGTTTGATT TGTCACCGTT 960 

GGAGCTTGGA TATGTGCGTG GCATGTGTGT GTGTGTGTGA GAGTGTGAAT GCACAGGTGG 1020 

55 GTATTTAATC TGTATTATTC CCCGTTCTTG GAATTTTCTT CCCATGGGGC TGGGGTACTT 1080 

TACATTCAAT AAATACTGTT TAACCCAAAA AAAAAAAAAA AAAAGAAAGA AGN 1133 

60 
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(2) INPORMATIGN FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

GGGCACAGCT GAAGCTGCAG ACCTCCCCAG GGGATGGCTC CTCTCCCCCA GGAGCCCCGA 60 

GGCAGGGGAG GCAGAAAGCC TGGGCTCTGG GGGGTGGCCT GCGGACAGCT GTGCICTGGG 120 

CCGGGGGCTG GGCCTGTCCC ACAGGGNCGT GGAGCTOSTG GTTCTGAGCA GCCAGCTCGG 180 

TGGTGTCTGG GGATAGCTGG GAGGCACAGC GGCTGCCATG TGGGACTGGG ACTCGAGXX3C 240 

20 TCCCTGGTCT TGGCCTCTGT GGCTCAGCCT TGCTCTGGTC TGCCTGAGTG CAGGGGCCAA 300 

GGGGCACAGG GCCAGTGAGG CCGGCCACGC TCGGGCCCTC ACCTGTGAGA •TCGGGTCGGA 360 

ATTTKACACA GCCTANQGCT TGGTTCTTGG TKGTNGAMCG TGGACTYCTK AGAACGGGAG 420 

TGCTGGTCCT .GAAAGGOGTG GTTGGAGACC AGCTGCTTTT CTCGCTGTTT TTCTCTTAGG 480 

AGATTAAACA AAAACAGAAA GCACAAGACG AACTCAGTAG CAGACCCCAG ACTCTCCCCT 540 

TGCCAGACGT GGTTCCAGAC GQGGAGACGC ACCTCGTCCA GAACGGGATT CAGCTGCTCA 600 

ACGQGCATGC GCCGGGGGCC GTCCCAAACC TCGCAGGGCT CCAGCAGGCC AACCGGCACC 660 

ACGGACTCCT GGGTGGCGCC CTGGCGAACT TGTTTGTGAT AGTTGGGTTT GCAGCCTTTC 720 

CITACACGGT CAAGTACGTG CTGAGGAGCA TCGCGCAGGA GTGAGGCCCA GGCGCCGAGA 780 

CCCAAGGCGC CACTGAGGGC ACCGCGCACC AGAGCGTGAC CTCGGCAGGC TGGACACACT 840 

40 GCCCAGCACA GGCAGACCCA CCAGGCTCCT AGGTTTAGCT TTTAAAAACC TGAAAGGGGA 900 

AGCAAAAACC AAAATGTGTG ACTGGGCTTT GGAGGAGACT GGAQCCTCAG CCCTOTCCTC 960 

GCCAOGGGCC GCTGGGQCT6 GTGTGQGTGG QCCITGTGTG CTGGATTTOT AGCTTATCTT 1020 

45 

CCGTGTTGTC TTTQGACCTG TTTTAGTAAA CCCGTTTTTC ATTTTAAAAA AAAAAAAAAA 1080 

AAACTTTGGG GQGQQGCCCC N HOI 

50 



(2) INFORMATION FOR SEQ ID NO: 120: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECHESS: double 

(D) TOPOLOGY: linear 

60 



25 



30 



35 
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10 



15 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

AGCTTCTCTG TCCAGTCTTG AACTCTGGGS TCTCTTGGAA CTTTCCTCAC CCCTCTCAGC 60 

CTGAATATTC CTTCCATGGA TTCCACTCAA CCAGACTTTG GATCTGTGCC TACTTAATCA 120 

ACCTTATCTT TGCAATATGT TCGGGCCCAC CTTCCACTCC TTGGTTCTTG TTCCTCCTTC 180 
GCCTAACTTG TCCCTTCTCC ACTTCACATC CCCGGTQGGA CAGCATTCCT CCTTCCTCCC " 240 

AACCTCCCTC CGTCTCAFAA AAAAAAAAAA AAAAAAAAAA TT 282 

(2) INFORMATION FOR SEQ ID NO: 121: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2635 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
TAAGGGGGTG TGTGCTCACC TCCTCCTGAC CCTTAACACT CCTGTCCTGC CCAGACCAAC 
AGAGAGAGCT GTCCCTGAGA CCCOGGAGAG AAGCAGCTGC OGAAAGCTGC AGCCTTTCCG 
30 CACTCTGAGA (iCATGATCTT CCTCCTGCCA GGGGAGftGCC ACCCACAGGC CATCTCCAGC 

CCCACTTCCC TCAGCCCCCA GGGYTTCCTT CTGGCCCCTC TGAGGATTCC CTAGGGCTGC 240 

CCCGCAGAGG GGYTTCCCCA AGCTCTGTTT TGAAGCCTGC AATGTGGAAA AGTGAGAAGT 300 

CAGAGGGAAC AGGACAGGTG CAGCCGGGCT CTGAGGCCAC ACCTCACACC TCGCTGTTCC 360 

CCAACATCCC CTGAGCAGTG TGAGCTCATC TCACCAGATG AGAAGAGGCC CTGTCCATTT 420 

40 YTTTTGTTTG TTTGTTGCTG TmCCCCCA. CCCATCCAGT TCTCCTCAGC AAAGCAAATT 480 

CCTTAACACC TTTGGTQGAG AATTTCTTAC OCAGACTTGG QGCTGTGATG CCCTTCAGTG 540 

CGTQGTGAGT GCAGCGTGTG TGCGTGTGCC TGTGTGTGAA CCTGGGOGCC ATCCTGGTGG 600 

45 

CCTGGGAGCG TGAGGAGAGG CCCCCTGTGT QCTQGGTGAG TGGTQGGTGT GGGGTCAATG 660 

CAGTGAGGCT CTCTGGGTGA GGCTCCCAAC CTGGCAGTCC CCAGCCTCCC" AGCATCTGTC 720 

50 AGCGTCTGTT GGACTTTACA GAAGAGCCTC ATCCYGTCTG CCCCTCACTC TGCCCTGGAA 780 

TCAACATCTT CCGAGTCCTT CTTGGGGGAA ATAGCAGAGC CCCACTTAAC TCCATAAACT 840 

GCTTCCCATT CCGCAGCCCA GTTCTGATTG TTGAGGTGTC GCGTCGTTCC AGGTCCCCCA 900 

GTCCCCTCTT TCTCCTGTCC TCTCTCTGTC CTTCACCTCC CCACTCCAGC CCCGGCTCAG 960 

TTCAGGGAAA TGCTGTTCCA YATCAGCCCT CTGCTCTCTG AGGCAGCCGC GCCTCTGACT 1020 

60 CGGAGCTACT TGAAACTTCT GCTCTTGCTA GGATTGGAGT CTACCTATCT CTTCCATTTG 1080 



wo 98/54963 



374 



PCTAJS98/11422 



TCCCAGCTGG AGTTCTGGAA CTTTCCTCCT CGGGGTGGC3G GTGGGG3??G TTAJ^iSGATGC 1140 



1200 
1260 

1320 
1380 



^ TGGGGGGCCT GGGGAAGGAA GGAGTTCAGA GGAAGGGTGT CCCCTSTXT CrT3SiTC?rC-. 
CCCTCCGCTC CTGGGACACX5 TGCTCTCTCT QTCTCTGGGT CTTCTGGr?S TGCiCCSTrTG 
TGTGTCCTTG TAAATATGTT TTAGGAAGAA AGCWkii^GGG ACTGAACT.-JS CCr-rTGCTAG 
10 GATTGCAGGG GTCCAGCCTT GCCTGTTTCC GAAGCCCCCA CACTGCCTTTT CGCCCCACTS 

AGACTGGTCC CCTCAAAAGG TAGACAAAAC AGCiGCICCC TGTGQ^r-rTG AA33GCGGCC 1440 
TCAAAGTGGC TTTTTGTTAG ACAAGGTTAA GGTTTCCTCA TGPiGOJ-J^yv TGZ^jGATCGG 1500 
TCCTTCCTCA GCTCCTTGAT TTGTGACCTT GACG^AGGGG CCTGCCACCC AGCCCCTCCA 1560 
GTGCCCTCTC CTCGATGCCT CGCTCCTTCC TGCCCCCACT CCCCTGGrTT AGGIAGGTAG 1620 

20 GGGAATTAGG GCCATGCTGG AAGAAGCTTA ACC A T G T G TT CAAAGAATGG TT??ITGCIT 1680 
GCTTGGTCCT GGAACTCCCC TTGGCTGCCC CAGGCCTCCT TGGCCCATC-G G1X:-CTC-GGGG 1740 
AGGTGGATGT CAGATCTGGT AGGTTGCAGC AGAGAA?ATA AATCTGCrTT GAG.iGACCAC 1800 
TCAGAGAGGG TCCAAGGGTG ATGGAGAAGG AAGG:.TG3CC TCGGAGCTTG GAA3GQARGG 1860 
GTGGTGQGTC GCX3GCATCTT GACTQCCCCC TOTTSTCCCA CACGTGGSGG GTGC?rCACCC 1920 

30 OfCTTCACTC CAGCCCGCCT GCCTTCAGCC 'TTCCATGAGC TTCACCTGCT TCCAACTTCA 
CTTTGGAGGG GGTQGGGTCC GTTGGCATCA ACACGGGGAC CCTCTGCTTC ACCAAAfiCCC 
GAGCCCTCAG CCXrCTGGQGA GAACAAATGG CTGAGCTTTG ATACCTG3C-G TCt^TCGAGAG 
GCTGCGGGCT GGCGGCAGTC CCAGGGGAGA GACACC=JCAG AAG-3Aa-^CC Aa-^iCCCCG 2160 
AGGAAGTTCC CAGCAGAGCA AACTGCTTTC CAGCCTGAAG CCTGCTTAAA CT^IGTGATG 2220 

40 TGCAATAACT GAGCTTAGAG TTAGGAATTG TGTTCAiCTG CTTGGA7?rC CGmOTAGA 2280 
TTTAACTGCT GAAATTGTAT CTCTCAGTAA TTTTAGATGT CTTITAAAAA AT^GAAAAAC 2340 
AAAGTGTTAG ACTGTGTGCG TGTGCGTTGA TGGGCACTCA AGAGTCCZTT GAGTCATCCA 

45 

GCCCTGCCTT TOCCTTGCGC CCCCATCCTC TCACGTCCCG CCC/GCC?CC ACTIGGGGAC 
CCTGCCTCGT GTCGTCTTTA TCTGCCTATT ACTCAGCCTA AGGAAACX^JG TACACTCCAC 2520 
50 ACATGCATAA AGGAAATCAA ATGTTATrTT TAAGAAAATG GAAAAT.V^JA AC7ITATAAA 2580 
CACCAAAAAA AAAAAAAAAA. ACCa«3QGGG GGGGCCGGTA ACCCATTTCG CCTAA 2635 

55 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 994 base pairs 



1980 
2040 
2100 



2400 
2460 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
GAATTCGGCA GAGGTTCGGC GAAGATAGGG AATAAGGAAG CACAGGAGTA GGGGAGAAGCi 
AAGCACAGGA GTAGGGGAGA TATACAGCGG TCAGGATAAG GGC3GAAAGGG CGGTGGTTGC 
SCAAGAGGTG AAACAAGATG TGAGAGACAA GQGGTAGGGA AGAAATGGGG CAGCGGTTAG 
GTTCAGAAGC GCATAGACCG TGGCGGACGG GCAATGCGAG GGGCACAGAA AGGAACTCAG 
GGGTGGGCTA TTTTAAKGGA GATGGTCCTT CAGCCCTCTT YTTTTCTGCG TAGTTCTCCT 
CCTCCAGGCC GCGCGCGGAT ATGTCGTCCG GAAACCAGCC CAGTCTAGGC TGGATGATGA 
CCCACCTCCT TCTACGCTGC TCAAAGACTA CCAGAATGTC CCTGGAATTG AC3AAGGTTGA 
TGATGTCGTG AAAAGACTCT TGTCTTTGGA AATGGCCAAC AAGAAGGAGA TCCTAAAAAT 
CAAGCAAGAA CAGTTTOTGA AGAAGATTGT TGCAAACCCA GAGGACACCA GATCCCTGGA 
QGCTCGAATT ATTGCCTTGT CTGTCAAGAT CCGCAGTTAT GAAGAACACT TGGAGAAACA 
TOGAAAGGAC AAAQCCCACA AACGCTATCT GCTAATGAGC ATTGACCAGA GGAAAAAGAT 
GCTCAAAAAC CTCOGTAACA CCAACTATGA TGTCTTTGAG AAGATATGCT GGGGGCTGGG 
AATTGAGTAC ACCTTCCCCC CTCTGTATTA CCGAAGAGCC CACCGCCGAT TCGTGACCAA 
GAAGGCTCTG TGCATTCGGG TTTTCCAGGA GACTCAAAAG CTGAAGAAGC GAAGAAGAGC 
CTTAAAGGCT GCAGCAGCAG CCCAAAAACA AGCAAAGCGG AGGAACCCAG ACAGCCCTGC 
CAAAGCCATA CCAAAGACAC TCAAAGACAG CCAATAAATT CTGTTCAATC ATTTAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAGGGGA GGGG 



(2) INFORMATICS FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1542 base pairs 
(6) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123; 
GGCASAQCCA CCTCGGCOCC GQGCTCCGAA GCQGCTCGGG GGCGCCCTTT CGGTCAACAT 
CGTAGTCCAC CCCCTCOCCA TCCCCAGCCC CCGGGGATTC AGGCTCGCCA GCGCCCAGCC 
AGGGAGCCGG COGGGAAGCG CGATGQQGGC CCCABCCGCC TCGCTCCTGC TCCTCCTCPT 



GCTGTTCGCC TQCTGCTCGG CGCCCGGCGG GGCCAACCTC TCCCAGGACG ACAGCCAGCC 
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CTGGACATCT GATGAAACAG TGGTGGCTGG TGGCACCGTC GTGCTCAAGT GCCAAOTCAA 300 

^ AGATCACGAG GACTCATCCC TGCAATGGTC TTAACCCTGC TCAGCAGACT CTCTACTTTC 360 

GGGAGAAGAG AGCCCTTCGA GATAATCGAA TTCAGCTGGT TAMCTCTACG CCCCACGAGC 420 

TCAGCATCAG CATCAGCAAT GTGGCCCTGG CAGACGAGGG CGAGTACACC IXSCTCAATCT 480 

10 TCACTATGCC TGTGCGAACT GCCAAGTCCC TCGTCACTGT GCTAGGAATT CCACAGAAGC 540 

CCATCATCAC TGGTTATAAA TCTTCATTAC GGGAAAAAGA CACAGCCACC CTAAACTCTC 600 

AGTCTTCTGG GAGCAAGCCT GCAGCCCGGC TCACCTQGAG AAAGGGTGAC CAAGAACTCC 660 

ACGGAGAACC AACCCGCATA CAGGAAGATC CCAATGGTAA AACCTTCACT GTCAGCAGCT 720 

CGGTGACATT CCAQGTTACC CXX3GAGGATG ATGGGGCGAG CATCXJTGTGC TCTGTGAACC 780 



50 



55 



60 



840 



20 ATCAATCTCT AAAGGGflGCT GACAGATCCA CCTCTCAACG CATTGAAG1T TTATACACAC 

• CAACTGCGAT GATTAGGCCA GACCCTCCCC ATCCTCGTGA GGGCCAGAAG CTGTTGCTAC 900 

ACTGTGAGGG TCGCGGCAAT CCAGTCCCCC AGCAGTACCT ATGGGAGAAG GAGGGCAGTC 960 

TGCCACCCCT GAAGATGACC CAGGAGAGTG CCCTGATCTT CTCTTTCCTC AACAAGAGTC 1020 

ACAGTGGCAC CTACGGCTGC ACAGCCACCA GCAACATGGG CAGCTACAAG GCCTACTACA 1080 

30 CCCTCAATGT TAATGACCCC AGTCCGGTGC CCICCTCCTC CAGCACCTAC CACX3CCATCA 1140 

TCGGTGGGAT CGTGGCTTTC ATrGTCTTCC TGCTGCTCAT CATGCTCATC TTCCTTOGCC 1200 

ACTACTTGAT CCGQCACAAA GGAACCTACC TGACACATGA GGCAAAAGGC TCCGACGATG 1260 

CTCCAGACGC GGACACGGCC ATCATCAATG CAGAAGGCGG GCAGTCAGGA GGGGACGACA 1320 

AGAAGGAATA TTTCATCTAG AGGCGCCTGC CCACTTCCTG CGCCCCCCAG GGCCCTGTGG 1380 

40 GGACTTGCTG QGGCCGTCAC CAACCCGGAC TTGTACAGAG CAACCGCAGG GGCXicSCCCr 1440 

CCCGWTTGTT CCXX»GCCCA CCCACCCCXTT TGTTACAGAA TGTYTKGTTT QGGGTGCGGT 1500 

TTTGTWATTG GTTTNGGATN GGG6AAGGGA GQGANGGCGG QG 1542 

45 



(2) INFORMATIOT FOR SBQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1390 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
CAAGCTCTAA TACGACTCAC TATAGGGAAA GCTGGTACGC CTGCAQGTAC CGGTCCQGAA 60 
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TTCCCGGGTC GACCCACGCG TCOSGGCCTC AGGGTGGACG CATGGTTCTG CACTGAGGCC 120 

CTCGTCATGG TGGCGCCTGT GTGGTACTTG GTAGCGGCGG CTCTGCTAGT CGGCTTTATC 180 

5 CTCTTCCTGA CTCGCAGCCG GGGCCGGGCG GCATCAGCCG GCCAAGAGCC ACTGCACAAT 240 

GAGGAGCTGG CAGGAGCAGG CCGGGTGGCX: CAGCCTGGGC CCCTGGAGCC TQA<3GAGCCG 300 
AGAGCTGGAG GCAGGCCTCG GCGCCGGAGG GACCTGGGCA GCCGCCTACA GGCCCAGCGT ' 360 

10 

CGAGCCCAGC GGGTGGCCTG GGCAGAAGCA GATGAGAACG AQGAGGAAGC TCTCATCCTA 420 

GCCTAOGAGG AGGAAGGTGT CX3AGAAGCCA GCGGAAAYTC ACCTGTCGGG GAAAATTGGA 480 

15 GCTAAGAAAC TGCGGAANNT GGAQGAGAAA CAAGCGCGAA AGGCCCAGCK Ta^GGCAGAG 540 

GAGGCTGAAC GTGARGWGCG GAAACGACTC GAGTCCCAGC GCGAATCAGT GGAAGAAC5GA 600 

GGAQGAGCGG CTTCGCCTGG AGGAGGAGCA GAAGGAGGAG GAGGAGAGGA AGGCCCGCX5A 660 

20 

GGAGCAGGCC CAGCGQGAGC ATGAQGAGTA CXTTGAAACTG AAGGAGGCCT TTGTGGTGGA 720 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEIKESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 
60 GGCGCGOGGG TGAAAGGCGC ATTGATGCAG CCTGCGGCGG CCTCQGAGCG CGGCGGASCA 



GGAQGAAGGC GTAGGAGAGA CCATGACTGA GGAACAGTCC CAGAGCTTCC TGACAGAGTT 780 

25 CATCAACTAC ATCAAGCAGT CCAAGGTTGT GCTCTTGGAA GACCTGGCTT CCCAGGTGGG 840 

CCTACGCACT CAGGACACCA TAAATCGCAT CCAGGACCTG CTGGCTGLAGG GGACTATAAC 900 

AGGTGTGATT GACGACCGGG GCAAGTTCAT CTACATAACC CCAGAGGAAC TGGCCGCCGT 960 

30 

GGCCAACTTC ATCCGACAGC GGGGCCGGGT GTCCATCGCC GAGCTTGCCC AAGCCAGCAA 1020 

CTCCCTCATC GCCTGGGGCC GGGAGTCCCC TGCCCAAGCC CCAGCCTGAC CCCAGTCCTT 1080 

35 CCCTCTTGGA CTCAGAGTTG GTGTGGCCTA CCTGGCTATA CATCTTCATC CCTCCCCACC 1140 

ATCCTGGGGA AGTGATQGTG TGGQCAGGCA GTTATAGATT AAAGGCCTGT GAGTACTGCT 1200 

GAGCTTGGTG TGGCTTGGTG TGGCAGAAGG CCTGGCCTAG GATCCTAGAT AAGCAQGTGA 1260 

40 

AATTTAGGCT TCAGAATATA TCCGAGAGGT GGGGAGGGTC CCTTGGAAQC TGGTGAAGTC 1320 

CTGTTCTTAT TATGAATCCA TTCATTCAAG AAAATAGCCT GTTGCAAAAA AAAAAAAAAA 1380 

45 AAAAACTCGA 1390 

50 (2) INFORMATION FOR SEQ ID NO: 125: 



60 
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15 



GACGCTCACC ACGTTCCTCT CCTCGGTCTC CTCCGCCTCC AGCTCCGCC3C TGCCCGGCAG 120 

CCGGGAGCCA TGCX5ACCCCA GGGCCCCGCC GCCTCCCCGC AGCGC5CTCCG CX3GCCTCCTG 180 

5 

CTCCICCTCC TGCrcCAGCT GCCCGCGCCG TCGAGOGCCT CTGAGATCCC CAAGC3GGAAG 240 

CAAAAGGCGC ATCCGGCAGA GGGAGGTGGT GGACCTGTAT AATGGAATGT GCTTACAAGG 300 

10 GCCAGCAGGA GTCCCTGGTC GAGACC3GGAG CCCTGGGGCC AATGGCATTC CGGC3TACACC 360 

TOGGATCCCA GGTCXXXSATG GATTCAAAGG AGAAAAGGGG GAATGTCTGA QGGAAAGCTT 420 

TGAGGAGTCC TGGACACCCA ACTACAAGCA GTGTTCATGG AGTTCATTGA ATTATGGCAT 480 

AGATCTTCGG AAAATTGCGG AGTGTACATT TACAAAGATG CGTTCAAATA GTGCTCTAAG 540 

AGTTTTGTTC AGTGGCTCAC TTCGGCTAAA ATGCASAAAT GCATGCTGTC AGCGTTGGTA 600 

20 TTTCACATTC AATGGAGCTG AATGTTCAGG ACCTCTTCCC ATTGAAGCTA TAATTTATTT 660 

GGACCAAGGA AGCCCTGAAA TGAATTCAAC AATTAATATT CATCGCACTT CTTCTGTGGA 720 
AGGACTTTOT GAAGGAATTC GTGCTQGATT AGTGGATGTT GCTATCTGGG TTGGCACTTG 780 
TTCAGATTAC CCAAAAGGAG ATGCTTCTAC TQGATGGAAT TCAGTTTCTC GCATCATTAT 840 
TGAAGAACTA CCAAAATAAA TGCTTTAATT TTCATTTGCT ACCTCTTTTT TTATTATGCC 900 
30 TTCGAATCGT TCACTTAAAT GACATTTTAA ATAAGTTTAT GTATACATCT GAATGAAAAG 960 

CAAAGCTAAA TATOTTTACA GACCAAAGTG TGATTTCACA TGTTTTTAAA TCTAGCATTA 1020 

TTCATTTTGC TTCAATCAAA AGTGGTTTCA ATATTTTTTT TAGTTGGTTA GAATACTTTC 1080 

TTCATAGTCA CATTCTCTCA ACCTATAATT TGGGAATATT GTTGTGGTCT TTTGTTTTTT 1140 

CTCTTAGTAT AGCATTTTTA AAAAAATATA AAAGCTACCA ATCTTTGTAC AATTTGTAAA 1200 

40 1GTTAAGAAT TTTTTTTATA TCTGTTAAAT AAAAATTATT TCCMACAACC TTAAAAAAAA 1260 

AAAAAAAAAA AAAAAAAAAA AAAAANAA ^288 



25 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 126: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPODOGV: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID m: 126: 

AGTCGCTTAA AQGCATCGTT TTAGGGATTA CTGGGAAGTA TCTTCAAAGT AATACATGAG 60 
AAACATTCCT TCCTAAATCC TTTATTATAT TGAATATCGT ATTAATTGGT TTTCAGAGGT 120 

60 
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TAAATTAACC ATGTATTCCT GCAATAAATG TCACTTGIOT CTTGTATATA ATCTTTTTTA 180 

TATATTACrCG GATTGATTCA TTAGTATTTT GTTGAGGATT TTTGTGTCTA TATTCATAAG 240 

5 AGATCCTGGT CTGCAGTTTT CrTTTTTTGT GATAATCTGG TTTTTGTATC AGTAATACAG 300 

GCCCCATCAA ACGAGTTGGG AAGTGTTCAC CTCTCTTGTA TTTTTTCAAG AGTTTGTGAA 360 
GAATTCCTAT TAATTCTTTA AATGTTTGGT AGAATCTACX: ATTGAAATCA TGTGTCCTGG ■ 420 

10 

GCTTTTTTTT GAGGGAAGTG TTCTGATAAC TAATTCAGTA TCTACTTTTT ATAGCTCTGT 480 

TCAGATTTTG CTTCTTCCTC AGTTAGTTTT GGTAATTTGT GTATCTCTAS GARTTTGTCC 540 

15 ATTTCATTTA TCTCATTTCT TGGCATAAAT TAAACTAAAT TTGGCXrTGAG CCTACCTGTA 600 

TATCTTGAGT CCCTCTGTAA GGAACTGTAG CCTAACTTGT ACATAAACAA ACTGAAATCC 660 

TAAATTAGGA ATGTAGTTTT TGTAACAGCT CCTGAGTCTC AGGCAGTCAC AGCAGYCAAG 720 

20 

TCTGTCAATT GCAGGCIGCT AACTAAGCAG CCCATGSTCA AATGAGGCAA AAACCTTTGC 780 

TTTTAACACA TAGTATAGCT TTGTAATCCT TTTCTTGCAC ACTCGGGTAA TTTCTTCCTT 840 

25 TTTCATTCCC KGWATTTTCC AKGAATATGA RTCTYCCTTT TTTCCCCTCC TGTCAGrPCTA 900 

GCTAATGGTT TCTCIAATTTT GTTGATCTTT TGAARAftCAA ACXnTTOSTT (XACTTTCTT 960 

GTTGCATATG CTGARTATTC TCATAATTGG AGTGGAAAGC TGATCTTTGA TTACTTATTT 1020 

30 

TACTTAGGGC TGAGGAGTTC ATGGACTTCG CAAARCCTCC TTGAATCTAA ATTGCATCTT 1080 

CTTTCCTCGT TTCTGGGCTG AAACATGTTT TTTCCCATCT WANAWACCCT TGGTCTTTTC 1140 

35 ATKGQCGATT AAGACTAGAG AAAGTTCTAG A1JK:CTTGTC CTTTTATGCT GTCATTTTGT 1200 

TTAAAGGCTT TCTATGTAGT AAAACTATCT ATATAGACAA AATAGAGCCT TGAGTTGTGG 1260 

TCTTCAATTT GATCAACATG ATTTACCACA TTCTGTACTG GATATTTCTT CACCTGCTGC 1320 

40 

TACTGTAAAC CATTTTATTC TTGGATCTTC TGTAGAGTAT ATTATCACAG GTACTTTTTA 1380 

* CAGGGGTC'TC TAATCTTTTG GCTTCCCTGG GCACATTGAA AGAAGAAGAA TTGTCTTGGG 1440 

45 CCACACATCA AATACGCTAA CACTAATAAT AGTTGATGAG CTAAAAAAAA AAAAAAAAAG 1500 

QCAAAAAAGN CCCAAAA 1517 

50 

(2) INFOPMATION FOR .SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1073 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

60 (xi> SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
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TGAATCTATT CTTTGAACAT TCTACAACAA GAATTACATT ATACTGTTAT ACCAGAGTAC 60 

TTCTCCAGTG TGAAATAGAT TOGTTTGGAA AATGAACCTG GCTTTGCTAT AAATTACATT 120 

Ci.CAGGCCTT TTTGCAAATG TGTAACTTGC CTATCAAAGT AGTTTGTAGG GCAAATGCAG 180 

AATATATGTC TCCATCTGGT AAAGTACCTT WTAYTCATGT GGGAAATCAA GTAGTATCAG 240 

AACTTGGTCC AATAGTCXIAA TTTGTTAAAG CCAAGGGCCA TTCTCTTAGT GATGGGCTGG 300 

AGGAAGTCCA AAAAGCAGAA ATGAAAGCTT ACATGGAATT AGTCAACAAT ATGCTGTTGA 360 

CTGCAGAGCT GTATCTTCAG TGGTGTGATG AAGCTACAGT AGGGRMGATC ACTCATGMTA 420 

GGTATGGWTC TCCTTACCCT TGGCCTCTGW WTCATATTTT GGCCTATCAA AAACAGTGGG 480 

AAGTCAAACG TAAGNTGAAA GCTATTGGAT GGGGAAAGAA GACTCTGGAC CAGGTCTTAG 540 

AGGATGTAGA CCAGTGCTGT CAAGCTCTCT CTCAAAGACT GGGAACACAA CCGTATTTCT 600 

TCAATAAGCA GCCTACTGAA CTTGACGCAC TGGTATTTGG CCATCTATAC ACCATTCTTA 660 

CCACACAATT GACAAATGAT GAACTTTCTG AGAAGGTGAA AAACTATAGC AACCTCCTTG 720 

CTTTCTGTAG GAGAATTGAA CAGCACTATT TTGAAGATCG TGGTAAAGGC AGGCTGTCAT 780 

AGAGTTATGT GTTAGTCTCA GGAGTCTTAA CTTTTGAAAT ATGTTTTACT TGAATGTTAC 840 

ATTAGATATT GGTGTCAGAA TTTTAAAACC AAATTACTGC TTTTTGAAAC CTCAAATTAT 900 

ATAATGTATC TTATGTATGT GCTTTATATT GTTATTTGTG TATACATTAA AATAATTCTG 960 

AATTATTTAA TCTGATATGT TGTATTCTGT ATCTTGAAAT TTTTGTTTCC TTGAAACATG 1020 

CATGCATTTA AAAATAAAGC TTAAACAACT GTAAAAAAAA AAAAAAAAAA CTC 1073 

(2) INFORMATIC»I FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 base pairs . . , 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGV: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 128: 

CAACCCCTGC CTTTTTTrTG TTTTCCATTT GCTTGGTAGA TCTTCCTCCA TCCCTTTATT 60 

TTGAGCCTAT GTGTGTCTCT GCCCGTGAGA TGAGTCTCCT GAATACAGCA CACTTACTQG 120 

TCTTGACTCT GTATCCAATT TGCCAGTCTG TGTCTTTCAT TTGGAGCATT TAGCCCATTT 180 

ACATTTAAQG TKAATATTGT TATGTGTGAA TTTRATCrPR TCATTATGWT GTTAGCTGGT 240 

TATTTTGCTT GTTAGTTGAT GCAGTTTCTT CCNGGCATCA ATGGTCTTTA CAAMTTGGCA 300 
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(2) INFORMATION FOR SEQ ID NO: 129: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1275 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
10 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

GGCAGAGCCT-GTCCCTGCTG CCCCTGCAAA AAAAACCCCC TCTGGTGTGA C3CAGGATGGT 60 

15 

TCGAGGTTAT GTGAGCTCCT TCTCCTTTCC TCCAGTTTCC TCTTCCCTTC TCCTCCCTGC . 120 

CTCTTTTGCT TTTCCCTTTC TTCCTGGTAC CCCCTGCCCA TTCCTGTATT TTCTCCCATC . 180 

20 GCCATTCTCC CCTCTCCCAC TGTCCCTAAC CCGTTCAAAC TCTTTCCTCT TAAATGGTTG 240 

AGATTTTCTC TCACCAAGCA CACCCCAGTA TTAATTAAAC TAGCTGCAAA CAGGCAGCAA 300 



GTGGTCTACC ATGACAGATG GGTTTTGTGT GTGTGTGTGT GTGTGTAATT GTAATAAAAC 360 

25 

ATATTGARTC ACTCAATAAA CACAGAGTGT CTACTACATG TATCARGCAC TATCATAGAT 420 

GCTAATTAAC GAAACTGAAA TGGCCAGGCC CTCACAGTGG CTCATGCCTA TAATCCCAGC 480 

30 ACTTTGGGAG GATGAGGCAG GAGGATCACT TGAGGCCGGG AGTTCAAGAC CAGCCTGGGC 540 

AACATAGTAA GACTCCATCT CTACAAAAAA AAAATTTTTT TTATTATACT TTAAGTTTTG 600 

GGTTACATGT GCAGAACGTG TAGTTTTGTT ACATAGGTAT ATACGTGCCC TGGTAGTTTG 660 

35 

CTGCACCCAT CAACCCATCA CCTACATTAG GTATTTCTCC TAATGTTACC CCTCTCCTAG 720 

CCCCCCACCC CGTGACAGGC CCTGGTGTGT GATGTTCCCC TCCCTGTGTC CATGTGTTCT 780 

40 CATTGGTCAA CTCTCACCTA TGGAGTGAGA ACATGTQGTA TrTGGTTTTC TGATCTTGTG 840 

ATAGCTTCCT GAGAATGTKG GTTTCCAGCT TTATCCACGT CCCTGCAAAG QGCATAAACT 900 

CATCCCTTTT TATQGCTGCA TAGTGTTCCA TQGTGTATAC GTGCCACATT TTCTTAATCT 960 

45 

ATCATTGATC GACAAGTTTT GCTATTGTGA ATAGTGCCAC AATAAACATA CGTGTGCGTG 1020 

TCTCTTTATA GCAGCATCAT TTATAATCCT TTGGGTATAT ACCCAGTAAT GGGATCACTG 1080 

50 AGTCAAATGG TATTTCTCGT TCTAGATCCG TAAGGAATTG CCACACTGTC TTCCACAATG 1140 

TTTGAACTAA TNTACACTCC CACCAACAGT GTAAAAGTGT TTCTATTTTT CCACAACCTC 1200 

TCCAACATCT GTTATTTCCT GACTTTTTAA TGAACGTCAT TCTAACTGGC GTGAGATGGT 1260 

55 

ATCTCATTGT GGTTT ^275 



60 
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(2) INFORMATION FOR SEQ ID NO: 130; 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 base pairs 
5 (B) TYPE: nucleic acid 

(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

10 

CNGAAACCCC GTGAACCCTC CCCGGGTTAA AAAGCCCCCC CTAAATGGGG GGAACGCYTC 60 

ACACGTTATA AAAAAjGCACT AGAATGTTTT GAAAGCGAGA AACAACAGCT GTGTAGGGTA 120 

15 GCTAGCAGTT AGTGTTGTAC AGAAGACAGA TATTTGTGCA TTTYTGCATT TTCTAAGTTT 180 

GCTGCAATGA GCATGTATTA CTTTCATAGT TATAAAACAC ATGCAAAATG CCCTTTTAAA 240 

ATGAAAAAAA ATCCATGAGT GTAAGTGATA TATATGCTTT QGAAAGCCTG GGACGGTCAT 300 

20 

TCTTTACTCT CAATAGTATG T G TTT G CCTT TG?rCTTTTTG AGACATTTTG TTTTAATCTG 360 

TTGATGACAA TAACCTGTTG ATAATATAAC TTGATAACAA ATAAAATGAC TTATGATTGA 420 

25 AWMAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA NN 472 



30 (2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1950 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
40 ACCTGTCAGA ATCTTCTCTC AGCAACCTGA GTCTTCGCCG TTCCTCAGAG OGCCTCAGTG 60 

ACACCCCTGG ATCCTTCCAG TCACCTTCCC TGGAAATTCT GCTGTCCAGC TGCTCCCTGT 120 

GCCGTGCCTG TNATTCGCTG GTGTATGATG AGGAAATCAT GGCTQGCTGG GCACCTGATG 180 

45 

ACTCTAACCT CAACACAACC TGCCCCTTCT GCGCCTGCCC CTTTNTGCCC CTGCTCAGTG 240 

TCCAGACCNT TGATTCCCGG CCCAGTGTCC CCAGCCCCAA ATCTGCTGGT GCCAGTGGCA 300 ' 

50 GCAAAGATCC TCCTGTOCCT GGTQGTCCTG GCCCTGTGCT CAGTGACCGA AGCTCTGCCT 360 

TGCTCTGGAT GAGCCCCAGC TCTGCAACGG GCACATQGGG GGAGCCTCCC GGCGGGTTGA 420 

GAGTQGQGCA TGGGCATACC TGAGCCCCCT GGTGCTGCGT AAGGAGCTGG AGTCGCTGGT 480 

55 

AGAGAACGAG GGCAGTCAGG TGCTGQCGTT QCCT6AACTG CCCTCTGCCC ACCCCATCAT 540 

CTTCTGGAAC CTTTTGTGGT ATTTCCAACG GCTACGNCTG CCCAGTATTC TACCAGGCtT 600 

60 GGTGCTGGCC TCCTGTGATG GGCCTTCGMA CTCCCAGGCC CCATCTCCTT GGCTAACCCC 660 
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TCATCCAGCC TCTGTTCAGG TACGGCTGCT GTGGGATGTA CTGACCCCTG ACCCCAATAG 720 

CiraXACCT CTCTATGTGC TCTGGAGGGT CCACAGCCAG ATCCCCCAGC GGGTGGTATG 780 

5 

GCCAGGCCCT GTACCTGCAT CCCTTAGTTT GGCACTGTTG GAGTCAGTGC TGCC5CCATGT 840 



TGGACTCAAT GAAGTGCACA AGGCTGTGGG GCTCCTGCTG GAAACTCTAG GGCCCCCACC . 900 



10 CACTGGCCTC CACCTGCAGA GGGGAATCTA CCGTGAGATA TTATTCCTGA CAATGGCTGC 960 



TCTGGGCAAG GACCACGTQG ACATAGTGGC CTTCGATAAG AAGTACAAGT CTGCCTTTAA 1020 

CAAGCTGGCC AGCAGCATGG GCAAGGAGGA GCTGAGGCAC CGGCGGGCGC AGATGCCXIAC 1080 

15 

TCCTAAGGCC ATTGACTGCC GAAAATGTTT TGGAGCACCT CCAGAATQCT AGAGACCTTA 1140 



AGCTTCCCTC TCCAGCCTAG GGTGGGGAAG TGAGGAAGAA GGGATTCTAG AGTTAAACTG 1200 



20 CTTCCCTC3TT GCCTTCATGG AGTTGGGAAC AGGCTGGGAA GGATGCCCAG TCAAAGGCTC 1260 



CAAGCGAGGA CAACAGGAAG AGGGATCCAC TGTTACCAAA AGTCCTGATT CCCCCATCAC 1320 



CAACCTACCC AGTTTGTTCG TGCTGATGTT GGGGGAGATC TGGGGQGAGT TGGTACAGCT 1380 

25 

CTGTTCTTCC CTTGTCCTAT ACCGGGAACT CXTCCTCCAGG GTACCCACAG ATCTGCATTG 1440 



CXXrrGGTCAT TTTAGAAGTT TTTGTTTTAA AAAACAACTG GAAAGATGCA GAGCTACTGA 1500 



30 GCCTTTGCCC TGAA3X3GGAG GTAGGGATGT CATTCTCCAC CAATAATGGT CCCTCTTCCC 1560 



TGACGTTCCT GAAGGAGCCC AAGGCTCTCC ATGCCTTTCT ACCTAAGTGT TTGTATTTTA 1620 



TTTTAAATTA TTTATTCTGG AGCCACAGCC CCCTTGCTTA TGAGGTTCTT ATGGAGAGTG 1680 

35 

AGAAAGGGAA GGGAAATAGG GCACCATGGT CCGGTGGTTT GTAGTTCCTT CAAAGTCAGG 1740 



CACTGGGAGC TAGAGGAGTC TCAAGCTCCC CTTAGGAAGA ACTGGTGCCC CCTCCAGTCC 1800 



40 TAATTTTTCT TGCCTGCCCC GCXrTTGGGGA ATGCCTCACC CACCCAGGTC CTGACCTGTG - 1860 



CAATAAQGAT TGTTCCCTGC GAAGTTTTGT TGGATGTAAA TATAGTAAAA GCTQCTTCTG 1920 



TCTTTTTCAA AANAAAAAAA AAAAAAAACT 1950 

45 



(2) INFORMATION FOR SEQ ID NO: 132: 

50 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
55 (D) TOPOIjOGY: linear 



(Xi) SEQUENCE DESCRIPTION: S£Q ID NO: 132: 
TQGAAGATTT AAAATAGGTT TCATATTTCT CTTGAATATG AATATATAAG CTTGAATAAG 60 

60 
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CTTGAGTCCr TATTATTATG A.--rtTTTTCCT TATTATTTCT ACCAATGCTT CTTATATTAA . 120 

AGCCTGATCj TTTrCATATT ASTATATGTA CATTAGCTGC CTGTGGATTA ACATTTCCAT 180 

5 GAAATOTAT? TTTGCAriGT TTGATCTTAA ACITTTTGTG TCTTTATATA AGGTATGCTY 240 

CrTmAASC-. TGATA.r?rTT AACCAGAATA GTTGAAAGAC AATCTYCACC TTTTACTTCT 300 

ATAZTTAJ^AT GTAJ^TGCAAT TTTTGATGCA TATTACGTCT TATTATTTAA CXIAACCTATT 360 

10 

TTATTTTATC TAG-3GCATTT TTCAGAAAGC CTTATTITCT TGTATTAATC AAATATTTTT 420 

AYC-.TTGTAr TTTCCfrTAT TAOT'AGKAA TACGKTACYC YAAATATATA TTGTGGSTAT 480 

15 TTTCAGAATT GCAATAT-SCC TCCTTAATTT ATTAGAGGCT AACCTAAATT ATTACTTTTA 540 

CCACTTACTT GAAAATTCTG GA-.CTTTAGA ACATTTATTG TTTTATGCAT TTTAATTCTA 600 

CTTOTATTTT TACTACTCCT AAACATTATT ATTGTTTTAG ACAAGCCAAA ATATATWTTG 660 

20 

TTATTAICTT AT/CTCCATT TCrrTTCTGTA' TTTTTATGCC ACTATGTATG CTCAATTTCC 720 

TTCTATGrrCA 1GAACC7AAT TCCTACTTT TGTTTTTTAA TCTGTGCAGG TAGCCTGGCC 780 

25 ATOAATTTT TA L ' lTI ' lG gT T?3CTGAAAA AATTGTGTTT ATTTCTATAT GCATACTTAT 840 

GCATATASAA T>XrrAG3r??S AC^TATTTTT AGTATTTATA AATGTAAAGT CATTWATTKG 900 

GCTTCTA-TCA TTTC:<G^<GA GAAATCAATT GTCAGCCCAA TAGTTTTTCA TTTTAAATTA 960 

30 

CNG.-ArmT TCAOxrrcrcT GGVITTAGGA 990 



35 

(2) BirOPI^ATICN FCR SEQ ID NO: 133: 

(i) SzQXjUTZZ CHAPACTERISTICS: 

(A) LSNGTr!: 1720 base pairs 
40 (3) TYPE: nucleic acid 

(C) STRAKDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUSiCS DESCRIPTION: SEQ ID NO: 133: 

45 

GTCTGATAAG Ca^unCTGGT TATTCCCCTA AAGTTTACTT CAGCACTAAC ACTAGTGCTT 60 

CCGCTGGAGT TTGCAGTrTT CCAGCTTTAT ACAGGATTTT CXMTTGACTG GAAGAGTCAA 120 

50 GGATATAGAG ACICAAZAGT GACATTTATT GTACAACATC AAGGGGAATA GGATACTCAT 180 

CAAACTQQGA TTATTCTTAT CAAAACATGG TCTTCTTTGA ATAAGAAAAA TACATAGTTG 240 

GT7ATTATOG ACTTAAAACT GCTTAAATG GATATTCTGA TAAAATATTT GCTGCTCTGT 300 

55 

AGAGTGTGGA AAATCTGAGA ATATTAQCTT TACTCATCTT GAGCTTTGAG GATGTTCTCT 360 

GTACGCCGAT GGOTTCATAT T^ACTAAAAA AGCTGGGTAT TGTAAAATCT CATTTATAAA 420 

60 AACTO^GATC AGAAGAAAAT TTTCTTTGAT GGTGAGACTG TTGTCTTAGT TCAGGAAA1T 480 
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ATTTAATAAT CCTTTGTTAC CTGTGAATGA AGGAACTTTG TAATTCTGAT TTATCGTAAA 540 

ACATGAGCCT TTCCAGAGTC AGCTTAGACA CTGTTGTCGC AAATAGCCAT GCTTTOXTT 600 

5 

ATCCCAAGGA QGCCCAGAGG GAGGGCCTAG TCTTCCTCTG TTGCTGTACA TATATTGAAA 660 

TGCTTTTTTT TTTTATTTTG CATTTGTTAT CTATAATGAG CTTTCTGAGC CCTGATATTA 720 

10 TOTGAGACAA ACAGGAGTTA TTGATGTTAT ACACTCCCTT CCATTCAGGA TTTTCTGCTT 780 

GGAGGGAAAT ATGTTGACCT TAGAGAATTG TGAATATTGT TGCAATTCTT GAATATATTA 840 

CCATGTGAAT AATAGAGACT GTGTTGCTCT CTAGTATAAG CTATATTTAT TTTTGATTCA 900 

15 

TTTCAATTAC TAGTTATAAC TGGAGAAATT TTGTTACCTC TATCCTGGCT TGCCTGACTG 960 
GCTGTATAAT AGCAGCAGCC TCTTTTAGAG CATCTTAATG AAAACATQGA TGAAAGGAAT ■ 1020 

20 TAATGATCAT ATCTGCASAC TGCGTAGAAA ATGGCTTTTG TTCCCAGCGT TAACATTTTC 1080 

TTCTCAATCA CATTTCAATG TTTGTGGAGA GTGGCAGATT CACACCAGAA ACACTAQGTG 1140 

TTCATATCCA TAGCATGGAT GCAGAATAAG CAGTTGGGAG AGAAGCTTCT TCCTACCTGG 1200 

25 

TACTCCICCC ATTCACCTCA GCCCAGCCCC AC5ACAGGCGT TAGCATTCAG TGTQGGCCCT 1260 

CAGGCAGCCC TCAAGCCTGG CTGGGTCATC AGATGQGGGC AGCCTGTGAC GGGCACCAGC 1320 

30 GGCCTCATTC CAGGGAAGAG TTCCTGGAGG GTQTTGGCTG TTTTTGTTAG CTCAGTnTT 1380 

TICTGGGCTC CACCATTCCT AACTCCAQGT AGACAAGATA GATGTCACAC ACAACAATTT 1440 

TAAAOTATTT TGCTTAGTGC ATTTTGTTTA TGATTGCAGT GTTTGTTTCT TATTTAATAG 1500 

35 

GCTTTTTACT TCATTCTATT AAATTTTAGT GTTTAGAAGA GGCGGGTACT GTCACTGTGT 1560 

AAAATATGTA ATATTTTATA TGTTATACCA TGTCATATAT ACTTGCAATA TCAGACCTTG 1620 

40 CATTCAATAT ACAATGCAAT TGACTCTTTG CAiSACCTGCA TTTTTCAGTG AACAATAAAA 1680 

AGATTGTCTG GCACTOCAAA AAAAAAAAAA AAAAAAAAAA 1720 



45 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 705 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

GGCAOSAGGC CATCIGGGCT CATTCAGCAG GAAATAATGG AAAAAGCTGC AATATCCAGG 60 
TOTTTACTAC AATCTOGAGG CAAGATCTTT CCTCAGTATG TGCTGATGTT TGGGTTGCTT 120 

60 
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GTGGAATCAC AGACACTCCT AGAGGAGAAT 
TTAAATATAG CACCTTTTAT TAACCAGTTT 
5 TCCTCATTGC CCTGTATACC TTTAAGCAAG 
ACTCCXSTATT TGAACACCTC TAACAGAGAA 
GACTTGACTG CTATTCCATT TTGQGTATCA 

10 

GATACTTCAA GTGAAGCCTC CCACTGGAAA 
CAGGTTGAAA TGGGAGAGGA ACTTGTACTC 
15 ATCACAGTAA AGCAATGAAG AGCAGTTTTC 
AAGTACAAAA TTCTTGTCTT AATTAGTGGG 
TATTTTTTAA AATTGACATT AATAAAGCAT 

20 



GCTGTTCAAG GAACAGAACG TACTCTTGGA 180 

CAGGTACCTA TACGTGTATT TTTGGACCTA 240 

CCAGTGGAAC TCTTAAGACT AGATTTAATG 300 

GTAAAGGTAT ACGTTTGTNA AATCTGGGAA 360 
TATGTACCTT GATGAAGANG ATTAGGTTGG " 420 

CAAGCTGCAG TTGTTTTAGA TAATCCCATC 480 

AGCATTCAGC ATCACAAAAG CAATGTCAGC 540 

CAATGAAAAC TGTGTAAATA GAGCATCAAC 600 

GGTATATAAA AATTCCTTGT AATGGTCAAA 660 

ATTTTAAAAG TTTCT 705 



25 



35 



(2) INFORMATION FOR SEQ ID NO: 135: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

AGCACACACC TCCTTTAGTT GCTCCTAAGG TCATGTTCAA CATTCGTGGA GTGCATTTTC 60 

TGCTCAGGGA GCTTTCCCAG ACCCGGAATG TTTGGTGCTC ACAGACYCTG GCAAGGATCG 120 

GTATTCCTGT TCCTCAGTTT TCCCTGGG6A AATGGAGGST CAGTGACGTT CAGTGACGTG 180 

40 CCCAGftGTCA TGCCATTGGC GGGTGGCCCA GKGMTCCAGG TCTCCAGCAC CCCTCGGCCC 240 

CCTCCTCACC AGC?rCACATC ATCTCCTGGA TTAGAATCTG CTCACATAGT CIGTCCTGAA 300 

AQ6AAAAAAA AAAAAAAAAA AAC . ^^3 

45 



50 



60 



(2) INFORMATION FOR SEQ ID NO: 136: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: double 
55 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 
GGACGGAATG GIGCAACCCT CCTWAIfPTTT CTKGKQCTGT TGACAACAGA GGGAGGGAGG 



60 
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GAAAACATTT TTyGTGGGAG AATCCTACYT CTGCAGSGGA GCCCTTAAGC GATKGATTTT 120 
GAATCTKGAC CCTTTACCAA CTAATTTTGA AGGAAGATAC CTTGGAAATA TTTGGCATTC . 180 

•5 AGTOGGTTAC TGAAACAGCA TTAGTGAATT CATCTAGAGA ACTCTTTCAT TTATTCAGGC 240 

AACAACTGTA CAACTTGGAA ACCTTGTTAC AGTCCAGTTG TGATTTTGGG AARGTATCAA 300 
CTCTACACTG CAAAGCAGAC AATATTAGGC AGCAGTCTGT ACTATTTCTC CATTATGTTA ' 360 

10 

AAGTTTTCAT CTTCAQGTAT CTGAAAGTAC AGAATGCTGA GAGTCATGTT CCTGTCCATC 420 

CTTATGftGGC TTTGGAGGCT CAGCTTCCCT CAGTGTTGAT TGATGAGCTT CATGGATTAC 480 

15 TCTTGTATAT TGGACACCTA TCTGAACTTC CCAGTGTTAA TATAGGAGCA TTTGTAAATC 540 

AAAACCAGAT TAAGGTTTGA CTGGTTTCAT TTGATTTTTA AG 582 

20 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 
25 ' (A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

TTCGGCAGAG CCCTTGCGCG CTCTTGAATA CCTGCKTTCT GTAQCGCTAG TTCTCTTCAA 60 

GATTTGCTTA GTGTCATTTC ATTTCGGTTT CTTTTCTCGC CATGTTTTTC TGTCGGAATT 120 

35 

ACGGTTCGTT TOXSGTTCTAT GTACTCTCTA AAATGTTATC GTTTTTCATT TGTCTACTAA 180 

TTTTCGTGCA TTTGTTACTA CTGAGTTTCT TAATATCTGA CTGGCCTCCG CCCACGGGCT 240 

40 CTGCAGANCA TAAAATACTC AGGCTGATGG TAGTGCAGAG ACTCTCCCTC CTTGATCAGC 300 

GCAAACGTTG GTCTGAGGCT TGAGGGATGG AGCAACATTT TCTTGGCTGT GTGAAGCGGG 360 

CTTGGGATTC CGCAGAGGTG GOGCCAGAGC CCCAGCCTCC ACCTATTGTG AGTTCAGAAG 420 

45 

ATCGTGGGCC GTQGCCTCTT CCTTTGTATC CAGTACTAGG AGAGTACTCA CTGGACAGCT 480 

GTGATTTGGG ACTGCTTTCC AGCCCTTGCT GGCGGCTGCC CGGAGTCTAC TGGCAAAACG 540 

50 GACTCTCTCC TGGAGTCCAG AGCACCTTGG AACCAAGTAC AGCGAAGCCC ACTGAGTTCA 600 

GTTGGCOGGG GACACAGAAG CAGCAAGARG CACCCGTAGA AKARGTGGGG CAGGCAGARG 660 

AACCCGACAG ACTCAGGCTC CRGCAGCTTC CCTQ6AGCAG TCCTCTCCAT CCVTGQGACA 720 

55 

GACAQCRGGA CACCGAGGTC TGTGACAGCG GGTGCCTTTT GGAACGCOGC CATCCTOCTG 780 

CCCTCCAGCC GTGGCGCCAC CTCCCGQGTT TCTCAGACTG CCTGGAGTGG ATTCTTCGCG 840 

60 TTGGTTTTGC CGCGTTCTCT GTACTCTGGG CGTGCTGTTC ACGGATCTGT GGAGCTAAGC 900 
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AGCCTTAGAT AGCAGCAGAA QGCTTTTTGG ATTCTCCTCC TTGAAAAGAT TCTCAGTTAC 960 

CAAACGTCTC CACCTAGAAA ATAAAAATAC ATTAAGATGT TGANAAAAAA AAANAAAAAA 1020 

5 

A 1021 



10 

(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1777 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

20 

CGGftAGATGA TQGCTTCAAC AGATCCATTC ATGAAGTGAT ACTAAAAAAT ATTACTTGGT 60 

ATTCAGAAOG AGTTTTAACT GAAATCTCCT TGGGC3AGTCT CCTGATCCTG GTGGTAATAA 120 

25 GAACCATTCA ATACAACATG ACTAGGACAC GAGACAAGTA CCTTCACACA AATTGTTTQG 180 

CAGCTTTAGC AAATATGTCG GCACT^GTTTC GTTCTCTCCA TCAGTATC5CT GCCCAGAGGA 240 

TCATCAGTTT ATTTTCTTTG CTGTCTAAAA AACACAACAA AGTTCTGGAA CAAGCCACAC 300 

30 

AGTCCITCAG AGGTTCGCTG AGTTCTAATG ATGTTCCTCT ACCAGATTAT GCACAAGACC 360 

TAAATGTCAT TGAAGAAGTG ATTCGAA^GA TGTTAGAGAT CATCAACTCC TGCCTGACAA 420 

35 ATTCCCTTCA CCACAACCCA AACTTGGTAT ACGCCCTGCT TTACAAACGC GATCTCTTTG 480 

AACAATTTCG AACTCATCCT TCAT^TCAGG ATATAATGCA' AAATATTGAT CTGGTGATCT 540 

CCTTCTTTAG CTCAAGGTTG CTGCAAGCTG GGAGCTGAGC T6TCAGTGGA ACGGGTCCTG 600 

40 

GAAATCATTA AGCAAQQCGT CGTTGCGCTG CCCAAAGACA GACTGAAGAA ATTTCCAGAA 660 

TTX3AAATTCA AATATGTGGA AGAGGAGCAG CCCGAGGAGT TTTTTATCCC CTATGTCTGG 720 

45 TCrCTTCTCT ACAACTCAGC AGTCGGCCTG TACTGGAATC CACAGGACAT CCAGCTGTTC 780 

ACCATGGATT CCGACTCAGG GCAGGATGCT CTCCCACCCG GACCCCTCCA GCCAAGCAGC 840 

CCTTCAAGTT CTTTTATTTC TGGGTAACAG AAGTAGACAG ACAGGTTACT TGGTGTATCT 900 

50 

TCTGTTAAAG AGGATTOCAC GAGTGTGTTT TCCTCACACA CTTTGATTTG GAGAATTQGT 960 

GCTAGTTGGC AATAGATAAC TCAGCGTAGA TAGTATTGCA AAAAGGGGAG GAAATACACA 1020 

55 ACAATAATAA ATCTAAAAAC CTGCTATTCA ACATGCAGTT TTATTTCGAR GCCAAAAATC 1080 

TAGAGCTTTC CCAAGATCCT GTTGCCTTAG GCACATNCAC ACTTCAACAG TGCACACTAT 1140 

CCAACAGTGC ACACTATTCA ACAGTGCACA CTATTCAAAA GCGTAGACTA TTTTTTTGCA 1200 
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TGTTCAAGAT ATTTGTTTTG GTCTTATGTG TGTGTGAGAG AGAGAGATTC CTTTGACATT 1260 

AAGGAGCATC AATGAGAAAA GATGATGAGG CAGGAATTAA TAAAGAAATG AAGTCGTGTG 1320 

TOTTTGGTTG CCTCTCAGAG GGCACACAAT TTCATAAACA CCATGCCTGG ACAATTTGAT 1380 

ATTAATATTT AACACCTCTG CATCTTTTTC TTAAAAAAGA ATATGGGCCA GATACAGTGG 1440 
CTCyVCATTTC TAATCCCAGC ACTTTGGGGA GCCAAGTTAG CAGAATCCCT TGAGCACAGG " 1500 

AATCTCAAAC CAGCTTOGGC AACATAGTGA GATCCCATCT OTACAAAAAA CTTAAAAATT 1560 

AGCCASGCAT GATGGCACAT ICCTGTAGTC CTAGCTACTC AGGAGGCTAA GGTAGGAC3GA 1620 

15 TTGCCTGAGC CCAGGAGTTC AAGGCTGCAG TGAGCTAAGN ACGTGCCAGT ACACTCCAGC 1680 

CTGAGCCACA AAGTCAGACC CTGTCTCGCA AAAAAAAAAN TTAAAAAGTC GGGGGGGGGC 1740 

CCGGTACCCA AATCGCCGGA TATGATCSTA AACAATC 1777 

20 



10 



25 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 139: 



(i) SEC20ENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 
30 (D) TOPOUOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

TTTTTTTTTT ' lTITl " n T r T ' ri T iTri ' lTl ' TTTTTTTGGG AATGAGAAAA TAACTTTATT 60 

TTCATTOIX3G GGAGCGGGCC GATGTCCAGC CTCAGAACTT CTGGAACTGC TTCTrGGTGC 120 

CGGCAGCCTT GGTGACCTTG AGCACGTTGA AGCGCACTGT CTTGCTCAGA GGCCGGCACT 180 

40 CGCCCACTGT GACGATGTCA CCGATCTGGA CGTCCCTGAA GCAGQGGGAC AGGTGTACAG 240 

ACATGTTCTT GTGGCGCTTC TCGAACCGGT TGTACTTGCG GATGTAGTGC AGATAGTCTC 300 

QGCGGATGAC AATGGTCCTC TGCATCTTCA TCTTGGGTCA CCACGCCAGA GAGGATCCGC 360 

CCTCGAATGG ACACATTACC AGTGAAQGGG CATTTCTTGT CAATGTAGGT GCCCCTCAAT 420 

AGCCrcCTTG GGGTGTCTTT GAAGCCCAGA CCXSATCTTCT TGTTAGTAAC CCGCGGGAGC 480 

50 TTCTCCTTCC CAGTTTCTCC CAGCAGGACC CTCTTCTTGT TTTGAAAGAT GGTCGGCTGC 540 

TTrraGTAGG CACGCTCAGT CTGAATGTCC GCCATCTTCT CGTGCCCaiAY TCCTGCAGCC 600 

CGGGQGATCC ACTAGTTCTA GfiGCGGCCQC ACCGCGGTG6 AGC 643 

55 



60 



(2) INFORMATION FOR SEQ ID NO: 140: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DBSCRIPTICMJ : SBQ ID NO: 140: 

GGCAOGAGGA TGATAGACCT ACTGGAGGAA TACATGGTTT ACAGGAAGCA TACCTACATR ' 60 

10 

AGGCTTGATG GCTCATCCAA GATCTCGGAG AGGCGAGACA TGGTTQCTGA TTTTCAGAAC 120 

AGGAATGACA TCTTTGTGTT CCTGTTAAGC ACACGAGCTG GAGGACTQGG TATCAATCTC 180 

15 ACTGCTGMAG ACACAGTGCA TTTTCTATGA TAGCGACTGG AACCCCACTG TGGACCAGCA 240 

GGCCATGGAC AGGGCCCACC GCTTAGGGCA GACAAAGCAG GTTACTGTGT ACCGGCTCAT 300 

CTGTAAAGGC ACCATTGAAG AACGCATTCT GCAAAGAGCC AAGGAGAAGA GTGAGATTCA 360 

20 

GCGGATGGTG ATrTCAGGTG GGAACTTCAA ACCAGATACC TTGAAACCCA AAGAGGTGGT 420 

TAGTCTTCTT CTAGACGACG AAGAGTTGGA GAAGAAACGT ATGTACTCTA AACCTCTATA 480 

25 CACTCCCCTC ACGTATCTGA GAATQGAAGA GGTACTTGGS TGTGTGCCAA GGGTTAGGCA 540 

AAGCCAGAGG CTGTATTTAG GGAAAGTATT TTTGTGCTCA TATTTTATAT AAAAACCCAA 600 

ACAAGAATGT GTTTGTAGGC CAGGCGTGGT GGCTCGCGCC TCTAGTCTCA GCATTTCGGG 660 

30 

ARGCCAAAGT GGGCAGATCA CCTGARGTCA GGARTTTGAG TTTGARACCA GCCTGGCCMA 720 

CGTTGTGAAA CCCCACCTCT ACTARGARTA CSGAAAATTG GTTGGGCATG GTGGCGGGCA 780 

35 CCTGTAATTC CAGCACTTTG GGAGGCTQGG GCAGAANAAT TGCTTGAGCC CAGGAGGTQG 840 

AGATTGCGGT GAGCCGAGAT YGTQCCATTG CAECTCCAGCC SGGGCAATAA GAGTGAAAYT 900 

CCATCTTTTA AAAACAAACA AAAACAAAAA ACACAAGACG GCTCACACCT GTAATCCCAG 960 

40 

CACTTTGGGA RGCCGARGCA GGTGGATCAC GARGTCAGGA GTTCCAAGAC TAGCCTGGCC 1020 

AACCTGGTCA AGCCCCGTCT CTACTAAAAA TACMAATATT AGTCGGGCGT GGTGGTGGGC 1080 

45 ACGTGTAATC CCAGCTACTC GGGAGGCTGA GGCAGGAGAA TCCCTTGAAG CTAGGAGGCA 1140 

GAGGTTGCAG TGAGCCAGGA TCGTGCCATT GCACTCCAGC CTGGACAACA AGAGCAAGAT 1200 

TCCATCTCAA AAAAAAAAAA 1220 

50 



(2) INFORMATION FOR SBQ ID NO: 141: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 
60 (D) TOPOLOGY: linear 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

AATTCGGCAC GAGCCAGGTT AGCCQGAAGG GCAGCTCTCC AGGCCCTGCC CACCCCflCAG 60 

5 

GGGGCTCCTT ATGCACAGCG GGGCGTCTCC TTGTGGCCAT AGAAACGGAA CTGGCTCTTT 120 

TCAACAGTGC TGCAAGAGGA TGGTTATTTA ACGCTGGCCC CCAAGGAGGA TUVGGCACAGA 180 

10 CYTTCCTCCC. TCCTCGAACA TCCAAGGGCA CTGGATCCTC TGTGTCCCTC TGAGATGGGG 240 

TGCCACTCCA GCAAGAGCAC CAOGGTGGCA GCTGAGTCCC AGAAGCTTGA AGAAGAGYQC 300 

GAGGGAAGAG AGCCAQGTCT GGAGACCGGC ACCCAGGCAG CAGACTGCAA GGATGCCCCG 360 

15 

CTGAAGGATC GAACCCCTGA GCCAAAGAGC TGAAATGCCT CTCTCCAGAG TCGGACCCTC 420 

ACCTCYTTCC TGGAACTGCC TTTGGCCCCA GAACCATGAG ACAATCCCCA CCCTGAGAAG 480 

20 CTCCGATCAC TGGGAGGAGA GAGAAAGCCT CCAGCTTTGG GATTCAQGCT TCAGAAGTTT 540 

TTAGCAGCCT TTGCTCATTG GAGAGGTGGG GAAAGGATAA AGTTCTTATA AGGAAATCCC 600 

TAATTTCCCC CAGCTCCTCC CCNCCNGAAG AAGGAACNAA AGAAAGTTCC TTCCACACGT 660 

25 

TTTGTTGGAA ACTTTTCCCT TGCCAACTTT CCTTGGATTG CCAGAACAAA GCCCTCCAGA 720 



30 



(2) INFORMATION FOR SEQ ID NO: 142: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1468 base pairs 
■ (B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTIC»J : SEQ ID NO: 142: 
ATCAATTAAT GTTTATAAAT GACTGTACTG AATTTAAAAC CGTACAGTTT CATTTGCATT 60 
45 TTGACATTAC TTTATTATAC ATTTTGCATT TAAAAGGCTG CACCAGTTGG CTTTTCTTCT 120 
GTTTTATTCT CAAAATATAG AGATTCTC5TG ATTTATTTGC CCTGTTTATG GATTAAAAAG 180 
AAAATTCTAA TATAAAGCAT TTCAATAGGA TGCATAQGTA TATTACGTTT TTTAAATGCT 240 

50 

TTAGATCTGT GATTCTTGAC TTACTATTTA TTTTATCCCC TTTAAGTCAG GGATGCTTTA 300 
TTCTATTTTA AAGCACTTAT GAGTTACATG TTGTAATCAA GTTTGCACAA TATATTTATC 360 
55 TATATGAGGA ACCCATAAAT GAATA(3CTAA TTTTTAAAAT GCCATTAAAA TGCATGAAAT 420 
KCTTATTAAA ACCTTACTAT ACTATTTCTT CAAGGCAAGT AAATTGACCA TGRGRAAAGR 480 
ACACAGTTAT TAAACACTGT TGACAGGAAA ATTCTCCTTG ATAACATAGG ACAATTAATG 540 

60 
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GAAAAAAAAA TTCTCATTAT TTGCAAAGAA TGAACAAGTT AATGAACAAA CAAACTAGAT 600 



TTCGTATGTT TTCAGCTTTt GTATCATGTT TAATTGTTTA ATTTGGTTGA AAAACTGCAG 660 



5 TTGAGAAATC AGATAGCAAT ATAGACATTC ACAGCAGCTC TGTGGATACC ATGTAATTGT 720 

CAGGTAATTT CAGAATGTTG AAAATTATTC AGTGCAGCCC TCATACTATC ATACTTGAAG 780 

AAATTGATTA CAGTTCCACT AAATTGTTGA AGATAAATTA TTTTTAAAGG TTATGAAAAC ' 840 

10 

TAAGTTATAT TAATTCATAT GTTTGATTTT TAAATCCCAC CTCCTCAAGC TATCCAATTT 900 



NCTGACTTTG AAAATAACCA TGACSAGATGC CACATTTCTC TCTGGGAAAC TACCACTCAA ' 960 
15 AGAATAATTG TTAAAAATTA AC3CTTTTAGG TATTAGAAGC TGTTATAAAG TATAAAATTA 1020 



AGATATAAGC AGATCACATG TAAATCATTC CTAAAGCACA AGAAAAGAAT GTGCCTTGAT 1080 

GTACATATAT TACTAAGTTG CCTCTCCCAG TTTACTTTAA AAATGGCTTT AAGGATAAAG 1140 

20 

AATAAATGTG ATAGCTGTGC ATGCATTATA TATTTGCATT TGCAAATTTC CCATrGTTTT 1200 

AACAGCTGTG TGGCTGACTT TCAATTOTAA GACGTGAATT GACATACAGC CCATAACTTT 1260 

25 ATAATCGCTG CTCATTTATC TTATCTTTCA GTTAGTGGAA AAACATTTCA ACCTGACTAA 1320 



AATTTGGAAT TGTGTCTTTT ATCTTCCATC CTCTGTTGTT ACTAGATTTA GTTTAAAAAT 1380 



TOTGTATGAC CATTAATGTA TGTCATAAAC ATGTAAATAA AAGATGTTGA ATCTTGTTGA 1440 

30 

AAAGCAWRAA AAAAAAAAAA AAACTCGA 1468 



35 

(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

45 

TGAATTTTTT GCCAAACTTA GTAACTCTGT TAAATATTTG GAGGATTTAA AGAACATCCC 60 
AGTTTGAATT CATTTCAAAC TTTTTAAATT TTTTTGTACT ATGTTTGGTT TTATTTTCCT 120 
50 TCTGTTAATC TTTTGTATTC RCTTATGCTC TCGTACATTG AGTACTTTTA TTCCAAAACT 180 



AGTGQGTTTT CTCTACTGGA AATTTTCAAT AAACCTGTCA TTATTGCTTA CTTTGATTAA ^240 
AAAAAAAAAA AAAAAAAAAA AAACCCCNAG GGGGGGGCCG GGTNCCCAAT CCCCCCCAAA 300 

55 



60 



(2) INFORMATION FOR SEQ ID NO: 144: 
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(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

TGCCTCCCTT CCTGCAGATT GTGGACAGTA GTTCCTCAGC CTGCACCCTG GATXCCTTCT 60 

TCCCCTTCCT AGCTCCATGG GACTCGCCCC AAGACTGTGG CTTCAAQGAC CACCAGCCCC 120 

TTACTCTTCA AGCCCTGACT GTGGAGTTGG TAGATGCCTC TGATCCTCAG TATTCTCTCT 180 

15 GGCAATGTTC CACGGCTTCT CCTTCCTGGG AGCTGGCTCC ATAACTTGAT TTTCCCCAAA 240 

CGTGTTGCAA TCCCTGCTGC CCCTTAGCCA CCCAGQGTCT TGTGTGGGTA TGAGTGTAGA 300 

GGATGGGGGT ATGCCAGGCC TGGGCCGTCC CAGGCAGGCC CGCTQGACCC TGATGCTACT 360 

20 

CCTATCCACT GCCATGTACG GTGCCCATGC CCCATTGCTG GCACTGTGCC ATGTGGACGG 420 

CCGAGTGCCC TTYCGGCCCT CCTCAGCCGT GCTQCTGACT GAGCTGACCA AQCTACTGTr 480 

25 ATGCGCCTTC TCCCTTCTGG TAGGCTGGCA AGCATGGCCC CAGGGGCCCC CACCCTGGCG 540 

CCAGGCTGCT CCCTTCGCAC TATCAGCCCT GCTCTATGGC GCTAACAACA ACCTGGTGAT 600 

CTATCTTCAG CGTTACATGG ACCCCAGCAC CTACCAGGTG CTGAGTAATC TCAAGATTGG 660 

30 

AAGCACAGCT GTGCTCTACT GCCTCTGCCT CCGGCACCGC CTCTCTGTGC GTCAGGGGTT 720 

AGCGCTGCTG CTGCTGATGG CTGCGGGAGC CTGCTATGCA GCAGQGGGCC TTCAAGTTCC 780 

35 CGGGAACACC CTTCCCAGTC CCCCTCCAGC AGCTGCTGCC AGCCCCATGC CCCTGCATAT 840 

CACTOCGCTA GGCCTGCTGC TCCTCATTCT GTACTGCCTC ATCTCAGGCT TGTCGTCAGT 900 

GTACACAGAG CTGCTCATGA AQCGACAGNG GCTGCCCCTG GCACTTCAGA ACCTCTTCCT 960 

40 

CTACACTTTT GGTGTGCTTC TGAATCTAGG TCTGCATGCT GGCGGCGGCT CTGGCCCAGG 1020 

SCTCCTGGAA GGTTTCTCAG GATGGGCAGC ACTCGTGGTG CTGAGCCAGG CACTAAATGG 1080 

45 ACTGCTCATG TCTGCTGTCA TGAAGCATGG CAGCAGCATC ACACGCCTCT TTGTGGTGTC 1140 

CTGCTCGCTG GTQGTCAACG CCGTGCTCTC AGCAGTCCTG CTAOGGCTGC AGCTCACAGC 1200 

OGCCTTCTTC CTGGCCACAT TGCTCATTGG CCTGGCCATG CGCCTGTACT ATGGCAGCCG 1260 

50 

' CTAGTCCCTG ACAACTTCCA CCCTGATTCC GGACCCTGTA GATTGGGCGC CACCACCAGA 1320 

TCCCCCTCCC AGGCCTTCCT CCCTCTCCCA TCAGCAGCCC TGTAACAAGT GCCTTGTGAG 1380 

55 AAAAGCTGGA GAAGTGAGGG CAGCCAGGTT ATTCTCTGGA GGTTGGTGGA TGAAGGGGTA 1440 

CCCCTAGGAG ATGTGAAGTG TGQGTTTGGT TAAGGAAATG CTTACCATCC CCCACCCCCA 1500 

ACCAAGTTCT TCCAGACTAA AGAATTAAGG TAACATCAAT ACCTAGGCCT GAGAAATAAC 1560 



60 
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CCCATCCTTG TTGGGCAGCT CCCTGCTTTG TCCTGCATGA ACAGAGTTGA TGAAAGTGGG 1620 

GTGTGGGCAA CAAGTGGCTT TCCTTGCCTA CTTTAGTCAC CCAGCAGAGC CACTGGAGCT 1680 

5 GGCTAGTCCA GCCCAGCCAT GGTGCATGAC TCTTCCATAA GGGATCCTCA CCCTTCCACT 1740 

TTCATGCAAG AAGGCCCAGT TGCCACAGAT TATACAACCA TTACCCAAAC CACTCTGACA 1800 
GTCTCCTCCA GTTCCAGCAA TGCCTAGAGA CATGCTCCCT GCCCTCTCCA CAGTGCTGCT " 1860 

10 

CCCCACACCT AGCCTTTGTT CTGGAAACCC CAGAGAGGGC TGGGCTTGAC TCATCTCAGG 1920 

GAATGTAGCC CCTGGGCCCT GGCTTAAGCC GACACTCCTG ACCTCTCTGT TCACCCTGAG 1980 

15 GGCTGTCTTG AAGCCCGCTA CCCACTCTGA GGCTCCTAGG AGGTACCATG CTTCCCACTC 2040 

TGGGGCCTGC CCCTGCCTAG CAGTCTCCCA GCTCCCAACA GCCTGGGGAA GCTCTGCACA 2100 

GAGTGACCTG AGACCAGGTA CAGGAAACCT GTAGCTCAAT CAGTGTCTCT WTAACTGCAT 2160 

20 

AAGCAATAAG ATCTTAATAA AGTCTTCTAG GCTGTAGGGT GGTTCCTACA ACCACAGCCA 2220 

AAAAAAAAAA AAAAAAACTC GAG 2243 

25 

(2) INFORMATION FOR SEQ ID NO: 145: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTIC»I : SEQ ID NO: 145: 
GCCAAGCTCT AATACGACTC ACTATAGGGA AAGCTGGTAC GCCTGCAGKT ACCGGTTCCG 60 
40 GGAATTCCOG GGTCGAOCXA CGCGTCCGCT TCCGTGTGTC AAAATCCTCA CCTCCTTCAT 120 
AACCATCTCC CACAATTAAT TCTTGACTAT ATAAATTTAT GGTTTGATAA TATTATCAAT 180 
TTGTAATCAA TTGAGATTTC TTTAGTGCTT GCTTTTCTGT GACTCAACTG CCCAGACACC 240 

45 

. TCATTGTACT TGAAAACTGG AACANCrTGG GAATGCCATG GGGTTTGATA ATCTGCCAGG 300 
GACATGAAGA GQCTCAGCTT CCTQGGACCA TGACTTTGGC TCAGCTGATC CTGNACATQG 360 
50 GAGAACAACC ACATTTTTCT TTGTGTGTCC TTCTAGCAGC TGTTCGGGAG GACCKTGACC 420 
CAAYftGTGTT CCCATGCTGT TTCTJGTGAA ATGCTCTCGG CTATGTAGCA GCTTTTGATT 480 
CCCTGCATAC CCTAQQCTGC TGCCCCTATC CTGTCCCTTG TTTATAACAT TGAGAGGTTT 540 

55 

TCTAGGGCAC ATACTGAGTG AGAGCAGTGT TGAGAAGTCG GGGAAAATGG TGACTACTTT 600 
TAGAGCAAGG CTQGGCATCA GCACCTGTCC AGCTCTACTT GTGTGATGTT TCAGGAACTC 660 
60 AGCCCCTTTT TCTGCCTAGG ATAAGGAGCT GAAAGATTAA CTTGGATCTY CTAATGGTCC 720 
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AAATCTTTTC GTCACAATAA AGAGTCTCCA AATTAGAGAC TGCATGTTAG TTCTGGATGG 780 

ATTTOGTCGC CrrGACA-TCAT ACCCKXTAG CTGTGAGGGG ACCCCGTTTT TA;^TGCAT 840 

5 

GGCCAAGCTC TCTGCAAATG GAAATGCTTA CACTGGGTGT TGGGGATGTT TGCTACCTCC 900 

TGCTATTTTT GTGGTTTTGG TTCTCCCACT ATGGTAGGAC CTCTGGCCAG CATTGrGGCT 960 

10 TGTCATGTCA GCCCCATTGA CTACCTTCTC ATGCTCTGAG GTACTACTGC CTCTGCAGCA 1020 

CAAATTTCTA TTTCTGTCAA TAAAAGGAGA TGAAAATAAA AAANAAAAAA AAAAAACTCG 1080 



NG 1082 

15 



(2) INFORMATION FOR SEQ ID NO: 146: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4313 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS : double 
25 . (D) TOPOLOGy: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

CAAGCTGGTT TGAAACTAGG GGTCGGGCTC GGCCGTCGTC GTTGTTTGTC GCCGCATCCC 60 

30 

CGCTTCCGGG TTAGGCCGTT CCTGCCCGCC CCCTCCTCTC CTCCCTTCGG ACCCATAGAT 120 



CTCAGGCTCG GCTCCCCGCC CGCCGCAGCC CACTGTTGAC CCGGCCCGTA CTGCGGCCCC 180 
35 GTGGCCACCA TGTCCCTGCA CGGCAAACGG AAGGAGATCT ACAAGTATGA AGCGCCCTGG 240 



ACAGTCTACG CGATGAACTG GAGTOTGCGG CCCGATAAGC GCTTTCGCTT GGCGCTGGGC 300 

AGCTTCGTGG AGGAGTACAA CAACAAGGTT CAGCTTGTTG GTTTAGATGA QGAGAGTTCA 350 

40 

GAGTTTATTT GCAGAAACAC CTT T GACCAC CCATACCCCA CCACAAAGCT CATGTGGATC 420 



CCTGACACAA AAGGCGTCTA TCCAGACCTA CTGGCAACAA GCGGTGACTA TCTCCGTGTG 480 
45 TGGAGGGTTG GTGAAACAGA GACCAGGCTG GAGTGTTTGC TAAACAATAA TAAGAACTCT 540 



GATT TC TGTXj CTCCCCTGAC CTCCTTTGAC TGGAATGAGG TGGATCCTTA TCTTTTAGGT 600 

ACCTCAAGCA TTCATACGAC ATGCACCATC TQGGGGCTGG AGACAGGGCA GGTGTTAQGG 660 

50 

CGAGTOAATC TCGTGTCTCG CCACGTGAAG ACCCAGCTGA TCGCCCATGA CAAAGAGGTC 720 

TATGATATTC CATTTAGCCG GGCCQQGGGT GGCAGGGACA TGTTTGCCTC TGTGGGTGCT 730 

55 GATGGCTCQG TGCGGATOTT TGACCTCCGC CATCTAGAAC ACAGCACCAT CATETAOGAA 840 



GACCCACAGC ATCACCCACT GCTTCGCCTC TGCTGGAACA AGCAGGACCC TAACTACCTG 900 
GCCACCATQ6 CCATGGATGG AATGGAGGTG GTGATTCTAG ATGTCCGQGT TCCTGCACAC 950 

60 
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396. 

CTCTSGCCAG GTTAAACAAC CATCGAGCAT GTGTCAATGG CATTGCTTGG GCCCCACATT 1020 

CATCCTGCCA CATCTGCACT GCAGCGGATG ACCACCAGGC TCTCATCTGG GACATCCAGC 1080 

AAATGCCCCG AGCCATTGAG GACCCTATCC TGGCCTACAC AGCTGNAAGG WGAGATCAAC 1140 

AATGTCCAGT GGGCATCAAC TCAGCXXGAA VTGTCGCCAT CTGCTACAAC AACTGCCTGG 1200 

AGATACTCAG AGTGTAGTCT TGGTGC30GCT GTGCCCACGA QGCAGGGGCT TTrGTATTTC • 1260 

CTCCXrrCTGC CCCACCCCCA AAGTAAGAAG AAACATGTTT CCAGTGGCCA GTATGTCTTT * 1320 

CATTGCTTTC a«XCACTGT TACCAGAAGC TGCTCTAGGA GTTCCTGGCC AGTCACCCCA 1380 

15 TCGCCCTCTG TGGCAGACTC AGTGCTGTCT GGCXXTCTCCT CAGCCCAGGG CTGAGTTTTA 1440 

AGATTTTCTC TCCTTTCCTC TTCTCCTTTG GTTCCTCAAT TAAAAAATGT GTGTATATTT 1500 

GTTTOTCAGG CGTTCK3TTG AGGAGCAGTT CACGCACTGG CTGTGTCTAT TCCTCTGCCC 1560 

AQGTGTCTCT GTTTGCTGCC CAAKGVWKKT TTTCATGTCT CGTCCATCTC CATGTTCGTG 1620 

TTAGCACTWA CGTGGGAACA AATACCAATT TGTCTTTTCT CCTAGTATCA GTGTGTTTAA 1680 

25 C7U\ATTTTAA CTTTCTATAT TTGTTATCTA TCAGGCTAAT TTTTTTATGA AAAGAATTTT 1740 

ACTCTCCTGC TTCATTTCTT TGTCTTATAG TCCTCCCTCT TTGCACCTTC TTCTCTTCCC 1800 

TCAGTGCCTG GAGCTGGTAC TGGGCCCCTG GCCCCATGAG CAGTTTGCCT TCTTGAGTCA 1860 

CTGCCTGTGT AGTACATACC TGACCGGGAG TCCAAACCAC CTTGGTGCTC TGAAGTCCAC 1920 

TGACTCATCA CACCTTTCTT AGCCTGGCTC CTCTCAAGGG CATTCTGGGC TTGTAAACAG 1980 

35 ACATAGGAAG CCTCTGTTTA CCXTTGAAGCA CCACTGTCCA GCCCATTGGT TCCCACTGGC 2040 

AGCATCGTAG AGCTGAGAGA AACW3GCTCT CAGGGTACCT GACTTGAGGG GAATCGTTTC 2100 
ATGAAGCTGA ACTTCAAGCA TATTTCCAGT ACATTCTTTC AGAGTCTGTT TTTCCATCCA 2160 
AATATAAGCC CCAGGCCATT CCACTTAGTG TCTTTTCAAT GATAGGCAAG AATGATATCT ' 2220 
GAGTTGAACT TCGGTGCTTC TGTTGTTTGA GTTTACTGTG CCTGGTGGTA TATTGGGCAT 2280 
45 TCTTTGGATT GAGTGTTCTG AGQTGAGAGA GTCTTCCCGA GGCATCCTGT CTGTGCTTCC 2340 
AACCCTOAAC AAGACCTTAC ATGAGAGATG GACTGATGGA CTGCQGCAAT CCTGGGCTGT , 2400 
CAAGTCGATA GATAQTTAAA AAGCATTATA CTGTQQGTAA T6AAAAGGGA GGAAAAAAAA 2460 
AGAAGGAAAA GGAATTATAG ACCCCCAGGG TCAGCCAGTT AAGAGCTCTA CXZCACACCTG 2520 
TCAACXXXrrC TCTCCCCCAG TTTAGGTTCT GAGCAGTATT GGACTTGTAG CXTTGCAGTTG 2580 
55 TCTTTTGACT TCCAGGCCGC AGrCTCTTTC TGTTATGTGA ATGAGTTCCA TGGAGGGGCA 2640 
TATGTOIGAT TCCACCGTTA GATGAGCCCT TGGGGCAGGC AGTTTGGGAT GTGCTCTTQG 2700 
GGGAAAGTTG GCTGTTTCCT TGCQCTCTQC TCCTACCOGA AGTmTAAG TCCCTCTGAA 2760 
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TTCCTCATCT GAGATTAGTA GAGTAGCAGG CCTGAAGGAT GATGGTTTTG TCCTCTTTGG 2820 

TTCTCACCTG CTTGAGAAGT AAAACAGTAA CTTTGTTCTT CTGGGCCCTT AAGCTTTTTT 2880 

GOrTAACTCT TCCTTTTCAG AAGTAGATGT CATTATATGC CAAAAGTCTA GCTCTTTGCT 2940 

TTACCATACA GGGACCTCTC CCAAAGAAAA AGGCTCTTTT TTTAGCCAGC ATATTTCCCC 3000 
TTCTACCCTT TTACTTTGTT GTTCTGATTT TAGGACTCTG GCTGGCCATG TGCTTGTGGT ' 3060 

TGCXrrcrcCT GCATTTGCCA CTGGATTTGC ACTGCATCCT TTGGAGATAC AAAGCX3AGCA 3120 

GTTCTTCGTC AGAACCCTCC TCTGCTTTTC ATTGTGTTTG ATAATGGTTA CTGGGTCCTT 3180 

15 CTCTCAAGGG TAGCAAGGCC AAGCTGATGG CTGCTTGTTT AGGAGGCCAT CAGTTCCTTC 3240 

CTOTGGAGAA GGGTCTGAAA TGGAAGTCAG TGGTAGAAGG GGCTGGTCTG CTGGGCAGGG 3300 

CTTACATCCA CTCAGTTCTA AGATTCCTTT CXTTGATCTGC ACCTACGCCT GGTCTGTATG 3360 

GOXXSAATTTC TCAGCTGGAA CTCAGAAACA ACAACTTGAA AAAAAAATAA TAATTAGAAC 3420 

ATATTTGCAT AAGATAGCTA TTTACTCTGG AAACCAACAA CTTTTGAGAT TTCCCTTGCC 3480 

25 CTGTGGACGC CCAGCTCCTG TCATCCTTCC TTAGGTCCTG CAGTACAGTC TTCCCCTGAA 3540 

IGCCACCGGG GACCCAGQGG GACTCCACCC CCCTAAGCAA GCACACACAT ACTCACAGTT 3600 

GATGAGTTGC TGGTCTTTGA GTCCCAGCTC TCITACCCTC CCTTTACTCC ACCAGCCCGA 3660 

CGACCCATGA CTCAGGAGGG GATTTCTACA GTCTCAGGAT TTAGAAAGTC TGTAAGCCAT 3720 

OCATGCTCCA GAAAGCACCG ATCTGTTCTA GTTGCAAAAA GAACTCTGTA ATTTGTTGAG 3780 

35 GTTCTCAAAC TGACAGCCAG CGAGACTGQG TGGGAGQCCC TGGATCTGTT CTCCCTGACT 3840 
GCGGGAGGAG CAGCCACTAG GACTTTAGCA GGAAGCCCAC ATGGAGGCTC CGCCAGGCTG 3900 
TOGCCCAGCT GGTGATOGCC CTTTTGCTCC TQGCAGCCTG AGGCACAGCT GCXTGTATTG 3960 
TCCTCATCTC TTCTCACTGA AGGATGGAGG TGCTGAATAA ATTAGGCTTC AGGCNTCTAC 4020 
CACCAGAGAG CTCGAGAATG GGTCCACGTC ATTCAAGGAC CTGAATTTTT TATGCTCAGG 4080 
45 AGCATTGGAA TCCTCTTCTT CCAGGGAGGA ATTAGCCTGC AAGGTTAGGA CTTGAAGAGG 4140 
GAAGGfTATTT AATAACTCGG CGAGGATQGG TGTGGTGGCT CACACCTGTA ATCCCAGCAT 4200 
TTTGGGAGGC TGAQGTOGCC AGATCCCAAG GTCAGAAGAT CX5AGACCATC CTGGCTAACA 4260 
TGGTX3AAACC CCATCTCTAC TAAAAATACA AAATTAAATT GGCCGGGCGT GAA 4313 



30 



40 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 147: 



(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 1183 betse pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

5 

GQCAGAGCCT CAAGCTGACT TGGATTATGT GGTCCCTCAA ATCTACCGAC ACATGCAGGA 60 
GGAGTTCCGG GGCOGGTTAG AGAGGACCAA ATCTCAGGGT CCCCTGACTG TGGCTGCTTA 120 



10 TCAKWYGGGG AGTGTCTACT CAGCTGCTAT GGTCACAGCC CTCACCCTGT TGGCCTTCCC 180 



TTGCATGCQG AGCGCATCAG CCTTGTGTTC CTGCTTCTGT TTCTGCAGAG 240 



CTTCCTTCTC CTACATCTGC TTGCTGCTGG GATACCCGTC ACCAOCCCTG GTCCTTTTAC 300 

15 

TGTGCCATGG CAGGCAGTCT CGGCTTGGGC CCTCATQGCC ACACAGACCT TCTACTCCAC 360 



AGGCCACCAG CCT6TCTTTC CAGCCATCCA TTGGCATGCA GCCTTCGTGG GATTCCCAGA 420 
20 GGGTCATGGC TCCTGTACTT GGCTGCCTGC TTTGCTAGTG GGAGCCAACA CCTTTGCCTC 480 



CCACCTCCTC TTTGCAGTAG GTTGCCCACT GCTCCTGCTC TGGCCTTTCC TGTGTGAGAG 540 



TCAAGGGCTG CGGAAGAGAC AGCAGCCCCC AGGGAATGAA GCTGATGCCA GAGTCAGAGC 600 

25 

CGAGGAGGAA GAGGAGCCAC TGATGGAGAT GCGGCTCCGG GATGCGCCTC AGCACTTCTA 660 

TGCAGCACTG CTGCAGCTGG GCCTCAAGTA CCTCTTTATC CTTGGTATTC AGATTCTGGC 720 

30 CTGTGCCTTG GCAGCCTCCA TCCTTCGCAG GCATCTCATG GTCTGGAAAG TGTTTGCCCC 780 

TAAGTTCATA TTTGAGGCTG TGGGCTTCAT TGTGAGCAGC GTGG6ACTTC TCCTQGGCAT 840 

AGCTTTGGTG ATCAGAGTGG ATQGTGCTGT GAGCTCCTGG TTCAGGCAGC TATTTCTGGC 900 

35 

CCAGCAGAGG TAGCCTAGTC TGTGATTACT GGCACTTGGC TACAGAGAGT GCTGGAGAAC 960 

AGTGTAGCCT GGCCTGTACA GGTACTGGAT GATCTGCAAG ACAGGCTCAG CCATACTCTT 1020 

40 ACTATCATGC AGCCAGGQGC CGCTGACATC TANGACTTCA TTATTCWATR ATTCAQGACC 1080 



ACAGTQGAGT ATGATCOCTA ACTCCTGATT TGGATGCATC TGAGGGACAA GGGGGKCGGT 1140 



STCCGAAGTG GAATAAAATA GGCGGGCGTG GTGACTTGCA OCT 1183 

45 



(2) INFORMATION FOR SEQ ID NO: 148: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 base pairs 
{B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 



60 



GAATTCGGCA GAGTGAAGCA TTAGAAT6AT TCCAACACT6 CTCTTCTGCA CCATGAGACC 



60 
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10 



20 



25 



45 



55 



TTQCTTCTAT TTAA 



(2) INFOBMATION FOR SEQ ID NO: 149: 



399 

AACCCAGGGC AAGATCCCAT CCXMXTACAT CAGCXTTACCT CXXTCCTGGC TGCTGOXAK 120 

CSATGTCGCCA GCATTACCTT CCACTGCCTT TCTCCCTGGG AAGCAGCACA GCTGAGACTG 180 

QC3CACCAGGC CAOCTCTCTT GGGACCCACA GGAAAGAGTG TGGCAGCAAC TCOmSGCTG 240 

ACCTTTCTAT CTTCTCTAGG CTCAGGTACT GCTCdCCAT GCCCATGGYT GGGCCGTGGG 300 
GAGAAGAAGC TCTCATACGC CTTCCCACTC CCTCTCSGTTT ATAC3GACTTC ACTCCCTAGC ' 360 

CAACAGGAGA GGAGGCCTCC TOGGGTTTCC CCRRGGCAGT AGGTCAAACG ACCTCATCAC 420 

ASTCTTCCTT CCTCTTCAAG CX?nTCATGT TGAACACAGC TCTCTCCRCT CTCTTGTGAT 480 

15 riCTGAGGGT CACCACTGCC ARCCTCAGGC AACATAGAGA GCCTCCTGTT CTTTCTATGC 540 
TTGGTCTCAC TGAGCCTAAA GTTGAGAAAA TGGGTGCCAA GGCCAGTGCC AGTGTCTTGG 
GGCCCCTTTG GCTCTCCCTC ACTCTCTGAG GCTCCAGCTG GTCCTGQGAC ATGCAGCCAG 

GACTGTGAGT CTCGGCASGT CXMGGCCTG. CACCTTCAAG AAGTGGAATA AATGTGGCCT 720 

734 



600 
660 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEOOESS: double 

(D) TOPGIiOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
GGCACAGIGG ACCCCAGACT CCCTCTCCGC CTTTCTCTGC CTGGGGAGAC CCACTGTGTG 60 
40 CATGGCATCA CTCACTCCCA TACCTCTGGC TATCAAAGGT TTCTGCCATG GCCACCCTGG 120 
AAGSAAACCA GAGGGAGGTA GACAGGGAGA TCAQGTCCCT TCTACTCTQG TTCCTGCTCT 180 
GTCAAATTGT CTCAGGCTGG CTGTGTCCAG AiW^STTCCCTG GTTCTCTCAR QGATGCCAAA 240 
TCTACAAGAA TCTCTCCTCT TCCAGTTCCT ATAACCTCTC CTTCCTTTTG TCTCTTTAGA 300 
CCTTGGAGTA GTAGCAGCCA GGTTCTTTCT ATCTCTGGGT TAGTGCATTA TCTCTGGTGG 360 
50 CTCCCTTACC CAGGACTTTG GGAATGGTCT TTTTGTAATA CATTCTCCTC AAATAATTCA 420 
ATTTTGAGTG TTCTCTATCT ATCCTGCTGG GAGGrrGTTA TATACAAATC ACTGTGCCCG 480 
TTTAGCAGAG AAGGAGACTG AAGCTCAGGG AGGTTAAGTG TCTTTCTCTA GGTCGTATTG 540 
TGGAGAAAGT GGCTCACTGG GGACTTGAAT GAGGTCOCTA GTTTCATGCT CGGAGGGCAA 600 
AGANGAATGT CCAATTCGCC TCAGATAAGC CTCTGGTAAA ATGTACTGTA CATAATAGGT 660 
60 AATCAATAAA TGTIGGCTCA TGACAAACAT GTTTTCTTTG TrCATTAGTT ATAGTGATTA 720 
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TCTTCTAAAT AACTCCMACA AGGAARTCAG CACATTTQGA ATATCAOTAT CTTTCCATGA 780 

TAATATCTTT CCMYGGAAAG AWAATGATAT TCCMAACTGG GAGTGTCCCW AGCARATCTG 840 

5 

ANTCTGTCTA TTGGCCCTGG GGTGGGCCAG CCCCTTAGAC TCTATGGTCT CATTCTCTTT 900 

GTTTACAAAA TTGAGATAAG GCCTTATTCT CTCCCCACCC CACCCATCCA TATTGTITTG 960 

10 AGAATAAAAT GAGAGGATGT GTGTCAAGGG TGTATnTGG CAATAGTCTC TGAGCCATTT 1020 

TCTCAGCACC TCCATACTGT TGACACTCAA GTAATATTTC ATCAGCATTC CATTCAGG^r^ 1080 

CCTCCCTTAA TGAGGTGTGC GATGTACAAG AGTVGTGAGG TGGCAAAGGA TGGGCTCCTG 1140 

15 

AGGAAACACT TAGGAAACTG GGCTTTCTGC CATTAAAAGA GACAAACCTT TGTGGTGACC 1200 

TAATTAAAGT TTTTAAAATT CAATTTQGAA AGTTAGCAAG CTAGCTCCTK TCCAGGWAAA 1260 

20 ATAAGGAGTC AGTCCATGAC CTAACCX3GTC CCGGGCTGCT TGCCATTCCA AACAACTGCA 1320 

GTAAGTTTAT CACOTTCTTT CAGGGACTGA GGTTTCCAGG CACAGACTTG GATAAGGAAG 1380 

GATGTCCTAT GGGGTCACAT TGATG 140S 

25 



(2) INFORMATION FOR SEQ ID NO: 150: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2890 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 
35 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 150: 

TTATATGCTA CAGCTACAGT AATTTCTTCT CCAAGCACAG AGGANCTTTC CCAGGATCAG 60 

40 

GGGGATCGCG CGTCACTTCA TGCTGCTGAC AGTGGTCGTG GGAGCTGGAC GTCATGCTCA 120 

AGTCGCTCCC ATGATAATAT ACAGACGATC CAGCACCAGA GAAGCTGGGA GACTCTTCCA 180 

45 TTCGGGCATA CTCACTTTGA TTATTCAGGG GATCCTGCAG GTTTATGGGC ATCAAGCAGC 240 

CATATCGACC AAATTATGTT TTCTGATCAT AGCACAAAGT ATAACAGGCA AAATCAAAGT 300 

AGAGAGAGCC TTGAACAAGC CCAGTCCCGA GCAAGCTGGG CGTCTTCCAC AGGTTACTGG 360 

50 

GGAGAAGACT CAGAAGGTiSA CACAGGCACA ATAAAGCGGA GGGGTGGAAA GGATGTTTCC 420 

ATTGAAGCCG AAAGCAGTAG CCTAACGTCT GTGACTACGG AAGAAACCAA GCCTGTCCCC 480 

55 ATGCCTGCCC ACATAGCTGT GGCATCAAGT ACTACAAAGG GGCTCATTGC ACGAAAGGAG 540 

GGCAGGTATC GAGAGCCCCC GCCCACCCCT CCCGGCTACA TTGGAATTCC CATTACTGAC 600 

TTTCCAGAAG GGCACTOCCA TCCAGCCAGG AAACCGCCGG ACTACAACGT GGCCCTTCAG 660 

60 . 
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AGATCGOSGA TGGTCGCACG ATCCTCCGAC 
CATGGC5CATC CCACCAGCAG CAGGCCTGTG 
5 TCTGACCCGC GCCTCGCCCC YTATCAGTCC 
GAACAAGTTT CTCCTGTTTG AGGCACAGAC 
GGAGAGCACA AGAAGACGTC CTGAGCATTG 

10 

GGACCAGTTT GCCTCCTTCC CTGCCTTAAA 
CTTTCCCCTT TGCATGTGAA ATACTGTGAA 
15 TGCTTGAAAT GCACAGTGCA GCAATCTTCG 
ACAGTATCAT TCCAAATTCC AAGATCATCA 
TCAATGCCTG GAAQGATTTT TTTTAATCTT 

20 

TGATCTCATT GGGATAATGA GAAAAGCTAG 
CAAGGAAGAC AAAGAAAAAC AAT6AAATCC 
25 AATGTCCTCC TTTTAAAAAA AAAAAAATGA 
ATATCCATTT AATGATTACA GTATTATTTT 
CTGAAAAACC AAATATGCCG GACAGGGTGT 

30 

TTGTGACXrr QGCTTCrCAT GTCCTTCTGG 
TATGAAATGT TAGCCAATTA ATACCAASAC 
35 GTTCTTCTGT AAAACTGTTT GCACATQGCC 
TCTGAGCXOT ATGGAGGCAG GACqpTGTCA 
GATGGCAAAC CCCATTTTTA AGTTATATTT 

40 

TTTTGTTrTT TGTTTTTTTG TrTTTTTTTA 
AGTGAAAGCA GCTTGGGATG TTQGAGCTAA 
45 AGCCTCCCTT TATTGAATTG GCMTAGGGA 
CAAAAACCTG GTTAGAiCJVrG CCAGCCTTTG 
CCAAGTGGCT TTATGGAGGC TGCATATAGA 

50 

CAGCTGCTAT TAACCCTATA ATGACTGAAA 
TGCACAGACT CCGGAAAAGT GAAGGCTGCC 
55 GCTGGTCTTG GATTTTTTTT CCATTAAATT 
TAAATAGCTT CAA^TmAA AAGTGGAATT 
CAGTGCTTTA TTTAATAATT CTCTTCTGTA 

60 



40! 

ACAGCTGGGC CTTCATCCGT ACAGCAGCCA 720 

AACAAACCTC AGTGGCATAA AYCGAACGAG 780 

CAAGGGTTTT CCACCGAGGA GGATGAAGAT 840 

TTTTCTGGAA GCAGAGCGAG CCACCTGAAA 900 

GAGCCTTQGA ACTCACATTC TGAGGACGGT 960 

AGCAGCATCG GGSTTCTTCT CCCCTTCTTC 1020 

GAAATTGCCC TGGCACTTTT CAGACTTTGT 1080 

AGCTCCCACT GTTGCTGCCT GCCACATCAC 1140 

CAACAAGATG ATTCACTCTG GCTGCACTTC 1200 

CCTTTTAGAT TTCAATCCAG TCCTAGCACT 1260 

CCATTGAACT ACTTGGGGCC TTTAACCCAC 1320 

TTTGAGTACA GTGCTTGTCC ACTTGTTTAC 1380 

GTTTAAAGAT TTTGTTCAGA GAGTAAATAT 1440 

AAACCTTAAG TAGQGTTGCC AGCCTGGTTT 1500 

GGCCACACCA AGAAGACGGG AAGACCTGGC 1560 

TCTGACCCGC GAAGTGCCCT ATCCTGGAAG 1620 

ACCTCATCTG CTCCTTCCCC AGTGGATQGG 1680 

AGGGGAGGGA ACTAGGACCC TTGTOTCCTG 1740 

TTGGCGGATG TGTCCTGCTC CATTGAGATG 1800 

CTTTGATTTT TGTTAATTTA GAGGTGTAGG 1860 

AGAGAAACAT TTATAACTGG ATAGCATTGC 1920 

TGCCAGCTGT TTATACTGCT CTTTCAAGAC 1980 

ATAAACAAGC CTTTAAACGT . GATAAAAGAT 2040 
CAAGGCAGGT TAGTCACCAA AGACTAACCT . 2100 

GAAGGCCTAA GOCTAGCAAC CATCTGCTCA 2160 

TCACCCCTCC ACTCTATTTT TGTGTTGTTT 2220 

AATCTGAGTA GTACTCAAAT GTGAGGAACT 2280 

CAGCTGATCA TATTGATCAG TAGATAAACG 2340 

GCAGTGTTTT TTCACTGTAT CAAACAATGT 2400 

TCATQGCATT TGTCTACTTG CTTATTACAT 2460 
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TGTCAATTAT GCATTTGTAA TTTTACATGT AATATGCATT ATTTGCCAGT TTTATTATAT 2520 

AGGCTATGGA CCTCATGTGC ATATAGAAAG ACAGAAATCT AGCTCTACCA CAAC?rTGCAC. 2580 

AAATGTTATC TAAGCATTAA GTAATTGTAG AACATAGGAC TGCTAATCTC AGTTCGCTCT 2640 

GTCATCTCAA GTGCAGAATG TACAATTAAC TGGTGATTTC CTCATACTTT TGATACTACT 2700 
TGTACCTGTA TGTCTTTTAG AAAGACATTG GTGGAGTCTG TATCCCTTTT GTATTTTTAA * 2760 

TACAATAATT GTACATATTG GTTATATTTT TGTTGAAGAT GGTAGAAATG TACTATGTTT 2820 

ATGCTTCTAC ATCCAGTTTG TACAAC3CTGG AAAATAAATA AATATAACAT AAAAAAAAAA 2880 

15 AAAAAAAAAA 2890 



10 



20 (2) INFORMATION FOR SEQ ID NO: 151: 



25 



30 



35 



40 



45 



50 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
' (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

GAACTTTTCC ATCTGGCAAA CCGGAAACTC CATCCCCATT AAACCAACTC CCCCTTTTGG 60 

TTTCCCCCCC AGNGGAATAG AATTTGGACN CCCATATAAA TCCAGGAAAC CACCTAAATT 120 

CTTTAGTNGT TTGTGTTTGC AAGATCTAAG GTCATGGTAA ACATTAAGTT CTTAAAATTT 180 

TTCGGAGGGA CCAGTGCACC TCTCCCTCTG AATTGTTCNC CAATTTAAAA TTGGAGTAAG 240 

GTTTTAAAAT GTCTNATTCC ATTGGAAGGG TWrGTTATTT CATTTTGAGC CCAGAGGGGA 300 

GAGGCACATT TTAAATATCA GAATTAGATT AGCTTTGAGT TTGTACAATT GGGAACATAA 360 

TAGATTITCA TAAATTATGT GTGCCTTGTT GGAAGTGTCA ACTGTCTTTA TGTCTGCTTG 420 

TAAAAGTTTC AAAATATGTT TTCCCTCAAA AAGGCAACGT TACTTCATTT GCTTGAATAT 480 

TATGATAQGA ATGCTTACTG ATATTACTTG ATAGTCATAT ATAGCCTAGG AAATTTAACA 540 

TATATATAAC TATAGCAGTA TTAATAATGA TAGTTGTACT TCTTTAAAAC ATTAAATTTG 600 

AGGAAACTTT AATGCTGTCT CGTGTACATT GCTTTACTAC AGTGAGGGGG AATATCCTTT 660 

AGATTGAGCC TCAATTTACT GGTTAGTAGT ATGTGAACTC TGGTATAAAA ACGTAAACTA 720 

GACAGTAGAG CCGATGAATT AAAATTGTAA ATTGCTACAT TGGCATTTTC TACCTCCTTT 780 

TCTGTCAGAG TATTACTTTT TCCAGCATTT ATTCTTATTT GTGAGTAAAG AGGAAATQGG 840 

AACCTGAGGT TAAAATTGAC ATTTTTGTTT CATTGAGAAT TTAAGCAC3TA GGTACAGGAG 900 

AAGTGACTTC TCACATTAAT TTGGTCCCTA AATCTGTAAC TACAAGTTGT GATCGACATG 960 
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TACAAAATGT CTAAGAAAGG TCATATGCTG AATATTTTAC TTTTCCTGTA TAGTCTGCAT 1020 

GATTTGTTTC ATAAACCCAG CTTATTTCCT CCAAAAAC5CA AAATGGTCCT GTAATTTTTA 1080 

5 

AAGTAAAATA AACGrTGCCAT TTTGTCTGCA ATCTATAATT TCAGGAAGTT ATTGRAAGTT 1140 

CTGACTCAGG GCTTTTTAAC AGTTCAAGCA ATTGTGAGTT ATATTTTGGA AACTCCATCT 1200 

10 GTGTAATTCT CCAGTGCCTT GAAAGAATTA TTAACTTGGC AACACTATTA AAACTTTATA 1260 

AAAGATGGTC TTTAGTGCAC GTGTATCATT ATATACACGT TTTAAAGTCA TATTGCTTAG 1320 

CTTGTTAATA ATGATTCTGC ATGTOTGCTG QGTTTGQGTA ATTCTTTAAA GGAAGTTTTC 1380 

15 

TAGATTTC5CA CTTGATGTTT GTTTTTTAAA AACTGATTAT TTATGGCCGT GACACTGTTA 1440 

CCAGAAAAGT AATTCTAATT AAGTTATTAT GCAAAGTCAT CTATAAGTAG CATCTGGGAA 1500 

20 GAGGAGATSG AGGCCACAGT TTGCTATTTT AGTATGAAAG GAGGATCTGT TTGGGAAACA 1560 

TAGATTGTCT TCCCCTCAAA TGAGGGGAAA AAAAAAGACC CTTTGTTCAA ATGGATTCTG 1620 

TTGTAAAAAA TTATTTTTAA AGGAAATCAC AAATTGTATG TCATTCTTAA TGCTAGTCTT 1680 

25 

ATAGAATAAA TCCATAAAAT TGTTTTTATG TTCAGTATGT TTATGTCATT CTAAATGCAG 1740 

CAAATTCAAT GATAGCAGTT CAATTGACTC ATAGCAGTGT TTTGTATTTT TTCTAATTCT 1800 

30 TTAGCTTTCA ATATTGGATT AAAGTCTTGT TTGTGAATAT AGTTTCCGTA TGGCAAATGA ' 1860 

TTTCTTGCTT ATTAGCTTTT GTTAAAGAAT GCTTAGTAAG AGCTAAGCTT TTAAAACSTAA 1920 

TGCAAACATT TATOGTTAAT AAAACCTATG GTGTAATATC ATATAATQCT TTTCTTTGAT 1980 

35 

CTTTGGAGAA TTATTCTTTT ATAGTAGTAT ACATGAATTT TGATTTTTAA AGCATTTAAA 2040 

AACAAATCTC AATACATTAA AAAACCTGTT ATTGTTAAAA RGGAAATTAC CATGCCTTTA 2100 

40 AGAAACAAGG ATGTACATCT TCAATTCAGC ATRAGTGTCC ACATCTAGAA GGCTCTCATT 2160 

GCAGTTGTTT ACAGTTAAGG TACCTCTATC TAAAGGGCCA AAGAAGCATT. TCATAYTTTA 2220 

ACACCTCACA TTCTTTCAGG ATTAAGACAT ATGAAAATAG TCTGAATAGG ATAAATTTGG 2280 
ATAQGAAGTA ACTTAACCAG TCTQGGAAGA TTCAGC5CTTT TTCTATKAAA AAGCTTATTC . 2340 

CTCTTCACAA CTCNGGTGGT AGGOTTTCAT TTTTCAAGAG GGTAGATATT TTAAAGCCA 2399 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 152: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 base pairs 

(B) - TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 
CGTGCCTCTA GTAAGCTCAT CCCTGCCTTT GAGATGGTGA TGCGTGCCAA GGACAATGTT 60 
5 TACCACCTGG ACTGCTTTGC ATGTCMCTT TGTAATCAGA GATTNTGTCT TGGAGACAAA 120 
TTTTTCCTAA AGAATAACWT GAYCCTTTGC CARACGGACT ACGAGGAAGG TTTAATGAAA 180 
GAAGGTTATG CACCCCMC3GT TCGCTGATCT ATCAACATCA CCCCATTAAG AATACAAAGC 240 

10 

ACTACATTCT TTTATCTTTT TTGCTCCACA TQTACATAAG AATTGACACA GGAACCTACT 300 
GAATAGCGTA GATATAGGAA GGCAGGATGG TTATATGGAA TAAAAGGCGG ACTGCATCTG 360 
15 TATGTAGTGA AATTGCCCCA GTTCAGAGTT GAATGTTTAT TATTAAAGAA AAAAGTAATG 420 
TACATATGGC TQGATTTTTT TGCTTGCTAT TCGTTTTTGT GTCACTTGGC ATGAGATGTT 480 
TATTTTGGAC TATTGTATAT AATGTATTGT AATATTTGAA GCACAAATGT AATACAGTTT 540 

20 

TATTGTGTTA CCATTTGTGT TCCATTTGCT YCTTTGTATT GTTGCATTTA GTACAATCAG 600 
TGTTTAAACT TACTGTATAT TTATGCTTTC TGTATTTACC AGCTATTTTA AATGAGCTGT 660 
25 AACTTTCTAG TAAAGAATTG AAAAGCAAAT CCTCACTAAA GGATACACAG GATAGGATAA 720 
AGCCAAGTCN CATCAACATT AAAAAATACT AAAANANAAA ACACAAAAAA AAAAAANCCC 780 
GGGGGGGGCC CQGAACCCAT TC 802 

30 

(2) INFORMATION FOR SEQ ID NO: 153: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:-461 base pairs 

(B) TYPE: nucleic acid 

" (C) STRANDEENE5S: double 

40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 153: 

CTAGGAGCAC CGAGCAGCTT GGCTAAAAGT AAGGGTGTCG TGCTGATGGC CCTGTGCGCA 60 

45 

CTGACCCGCG CTCTGCNCTC TCTGAACCTG GCGCCCOCGA CCGTCGCCGC CCCTGCCCCG 120 

AGTCTGTTCC CCGCCGCCCA GATGATGAAC AATGGCCTCC TCCAACAGCC CTCTGCCTTG 180 

50 ATGTTGCTCC CCTQCCGCCC AGTTCTTACT TCTGTGGCCC TTAATGCCAA CTTTGTGTCC 240 

TQGAAGftGTC GTACCAAGTA CACCATTACA CCAGTGAAGA TGAGGAAGTC TQGGGGCCGA 300 

GAOCACACAG GTOQGAACAA GGACAGGGGG ATTTAAGCAG TCAAAAGGAA AAACATGTTA 360 

AGACCCTAGA CTIOTATATT GACACACTTG TACCTTGTAA GGCAGAGGAA TGTAATTAAA 420 

AAGCACTTAT TTGGCWNAAA AAAAAAAAAA AAAAAAAAAA C 461 



55 



60 
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(2) INFORMATION FOR SBQ ID NO: 154: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2388 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiJble 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

GCCCACGCGT CCGAAAQCGG AGAACGCTGG TGGGCCTGTT GTGGAGTACG CTTTGGACTG 60 

15 AGAAGCATCG AGGCTATAGG ACGCAGCTGT TGCCATGACG GCCCAGGGGG GCTGGTGGCT 120 

AACCGAGGCC GGCGCTTCAA GTGGGCCATT GAGCTAAGCG GGCCTGGAGG AGGCAGCAGG 180 

GGTCGAAGTG ACCQGGGCAG TGGCCAGGGA GACTCGCTCT ACCCAGTCGG TTACTTGGAC 240 

20 

AAGCAAGTGC CTGATACCAG CGTGCAAGAG ACAGACCGGA TCCTGGTGGA GAAGCGCTGC 300 

TGGGACATCG CCTTGGGTCC CCTCAAACAG ATTCCCATGA ATCTCTTCAT CATGTACATG 360 

25 GCAGGCAATA CTATCTCCAT CTTCCCTACT ATGATGGTGT GTATGATGGC CTGGCGACCC 420 

ATTCAGGCAC TTATGGCCAT TTCAGCCACT TTCAAGATGT TAGAAAGTTC AAGCCAGAAG 480 * 

TTTCTTCAQG GTTTGGTCTA TCTCATTGGG AACCTGATGG GTTTGGCATT GGCTGTTTAC 540 

30 

AAGTCCCAGT CCATGGGACT GTTACCTACA CATGCATCGG ATTGGTTAGC CTTCATTGAG 600 

CCCCCTGAGA GAATGGAGTT CAGTGGTGGA GGACTGCTTT TGTGAACATG AGAAAGCAGC 660 

35 . GCCTGGTCCC TATGTATTTG GGTCTTATTT ACATCCTTCT TTAAGCCCAG TGGCTCCTCA 720 

GCATACTCTT AAACTAATCA CTTJCTGTTAA AAAGAACCAA AAGACTCTTT TCTCCATGGT 780 

QGGGTGACAG GTCCTAGAAG GACAATGTGC ATATTACGAC AAACACAAAG AAACTATACC 840 

40 

ATAACCCAAG GCTGAAAATA ATGTAGAAAA CTTTATTTTT GTTTCCAGTA CAGAGCAAAA 900 

CAACAACAAA AAAACATAAC TATGTAAACA AGAGAATAAC TGCTGCTAAA TCAAGAACTG 960 

45 TTGCAGCATC TCCTTTCAAT AAATTAAATG GTTGAGAACA ATGCATAAAA AAAGTTGCAC 1020 

AAGnCCTTA TTTTCCTTAA TATTTCACTT CTATTTAATA CAAGCTGGGA CATAAAAATT 1080 

CTGTTGGGGA TACCTGGGGG AAGATGTGAG AAACTAATGC TGAATTCAQC TTATACATGA 1140 

50 

TGAAAAGAAA AACCAGACAA AAGGAGCACA TAAATATGCA TACAGTGTAA CTGTTATTAT 1200 

TTTAATACCC ACGATAAGGG ATTTTTGTTA GCATGTTTAG GGGGAACGAG GATTGGTGGG 1260 

55 ATCCTTGGGG CCACAGGAAT CTGAGGCAAC GGAAGATATA TAGAGTGATC GTCCCCCTGC 1320 

CGAAGGAACC TQGCAYCTGT CAAGCAGATG CTGCAGTTCA AACTTCAGCT TTTAAGATAG 1380 

ATAGCTATTG AAGGCAGAGG GTCAGCAGGA GGATGTGTAT TTCTAATCTA CCCTQGTAAA 1440 

60 
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GTCATAGGTA AGACTCAAAA GCGGGATCTT ATTCAAAAGG CAGGTATTTC CTTTGTTTTC 
TOTCTTCAAA TAGCCCCTTC CCCTAAGGTG CATTCTCTCA AGrTTTTCAGT ATTGCTTTAT 
TTCCAGTCAT TAAAAGAGAT GAGAGACTTT GGAGACAGAC AACCTAAGCA ACACATACAC 
ACATGAAATA CTCTAGACAG AGATGAATAT AAATCTGGCC TAATAACCAG TTTTCCATGT 
AACAGTGATT TTGTGTITCG GGCTGAAGCA GTGGTTATAT TAAAAGCCAC TAATTCCCTT 
ATCCCTTTAA AAGATTTTTA CAATTCTCCA ACCACAAACA GCACTTCTAA AACTAACTTT 
ACTTTCTXSCC CATAATTTGT TCTACATGGA AAAAAAAAAT ATTACTTTGG CCAQGGGTGT 
GTX3TAAATGT GGCAGAATTC CTAGGCAGGC TGACCTTTAC AGTATGGGCC TTTAAGATAC 
TCGATCCTQG TTGGGCAACA AGTGTCACGC CTGAAGTTTC TGAAAACAAA TTAGAAGACT 
CTTCGCTTCG CTAATCTCGT AGTTCAGGGC CAAGTTTCTG TAGTCAGAAT GAAGAATAAA 
ATTGAAAGAA AAAGGGGGAA ATGCTTATAC TTGQCATTAA GTTGAATGCC TCAAGTCTTA 
ACTATCGCTT TGTAGATGAG GCAAAAGATT TCTTAGTGGT AAAATTTCTT CAACAGGTCA 
ATGCCAATCT GTATGCCATT TTAGTAAAGT AGGTAAGGAG AGTAGCCGCT CAGTAACTTT 
GGCACTAAAG AAAGAGTOTG GCTCTAGAAC TTCCAATCCC ATTGCTAGAT GTGCCCTTTA 
AAAGASraGTC CAGTGCTTTC AGGGAAGGAT GTTTAGCCAG TTTTCCTAGT ATTTGTTCCT 
TAAGATTTTT TGACCTGTGC TTAATAAGAC GGACXXGTGG GTCGACCC 

(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI»IESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
AAAACAGACC ATTTAAAAAC TCAGACAAGA TTATATTTAA TATATTAATT ACTAAAAAGG 
CACAAGATTA CACTGAACAT ATTAGCTACT AAAAAGQCAC TGCTAAGACA TTCAAGCAAA 
TAGCTATTAC ACACTACTGC AGATTTTACA GGTTTCTAAT TCTAACATAT GTTTGAAAAA 
TCCGTCftGTA TTCCAAAATA TATTTAATAA TGGAATATCT • GCATTAATAT ACCATCCATG 
TOTTTTTACC ATTTCCCTTA ATATTGAATA TACTGTTTAC CTCACACTAA AAAGAAAACC 
AGAAGCCTTA TTTGTGATTT TGGGAGTGGA AGCTTCCATT TTTGTGTCAA AAATGAATCC 
TGATTCTTAT GGAAATCTCT GTTATTAAGA TATTTCAAGA TGAGACAACA CTGAAGATCA 
AATTOIX3TTT AGTATCACTA TCTTCTCTCC TCGTTTCTCT CTTACTCCTC ATCCTCCCAG 
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AATCTACCAG TTTATGGTAG AAAGATGGGA ACCTTATTTG AATGTGTTTT TTTTTTTCCA 540 

TGATGTCCAA TTTTGTTGTG GGAAAGGATT TGGATAAAAT TTTTGTTTAA ATTTTGGTAG 600 

ATTTTTATCT ATACAAATTT AAATAAAATT ATGTTTTGTA AG 642 

(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI»IESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

GCCGCTGCCC CTCCACGGAG TTGCTGATCA TCTGGGCTGT GATCCACAAA CCCQGTTCTT 60 

TGTCOCTCCT AATATCAAAC AGTGGATTGC CTTGCTGCAG AGGGGAAACT GCACGTTTAA 120 

AGAGAAAATA TCACGGGCCG CTTTCCACAA TGCAGTTGCT GTAGTCATCT ACAATAATAA 180 

ATCCAAAGAG GAGCCAGTTA CCATGACTCA TCCAGGCACT GAGCATATTA TTGCTGTCAT 240 

GATAACAGAA TTGAGGGGTA AGGATATTTT GAGTTATCTG GAGAAAAACA TCTCTGTACA 300 

AATGACAATA GCTGTTQGAA CTCGAATGCC ACCGAAGAAC TTCAGCCGTG GCTCTCTAGT 360 

CTTCGTGrrCA ATATCCTTTA TTGTTTTGAT GATTATTTCT TCAQCATGGC TCATATTCTA 420 

CTTCATTCAG AAGATCAGGT ACACAAATGC ACGCGACAGG AACCAQCGTC GTCTCGGAGA 480 

TGCAGCCAAG AAAGCCATCA GTAAATTGAC AACCAQGACA GTAAAGAAGG GTGACAAGGA 540 

AACTGACCCA GACTTTGATC ATTGTGCAGT CTGCATAGAG AGCTATAAGC AGAATGATGT 600 

CGTCCGAATT CTCCCCTCCA AGCATGTTTT CCACAAATCC TGCGTGGATC CCTGGCTTAG 660 

TGAACATTGT ACCTCTCCTA TGTGCAAACT TAATATATTG AAGGCCCTGG GAATTGTGCC 720 

GAATTTGCCA TGTACTGATA ACGTAGCATT CGATATGGAA AGGCTCACCA GAACCCAAGC 780 

TGTTAACCGA AGATCAGCCC TCGGCGACCT CGCCGGCGAC AACTCCCTTG GCCTTGAGCC 840 

ACTTCGAACT TCGGGGATCT CAOCTCTTCC TCAQGATGGG GAGCTCACTC CGAGAACAGG 900 

AGAAATCAAC ATTGCAGTAA CAAAAGAATG GTTTATTATT GCCAGTTTTG GCCTCCTCAG 960 

TGCCCTCACA CTCTGCTACA TGATCATCAG AGCCACAGCT AGCTTGAATG CTAATGAGGT 1020 

AGAATGGTTT TGAAGAAGAA AAAACCTGCT TTCTGACTGA TTTTGCCTTG AAGGAAAAAA 1080 

GAACCTATTT TTGTGCATCA TTTACCAATC ATGCCACACA AGCATTTATT TTTAGTACAT 1140 

TTTATTTTTT CATAAAATTG CTAATGCCAA AGCTTTGTAT TAAAAGAAAT AAATAATAAA 1200 
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ATAAAAAAAA AAAAACCCCG GGGGGGGCCC GGTCCCCAAT TGGCCCTATG G 1251 



(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2127 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIjOGY: linear 



15 



25 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

CCGGCGGGAG AGGGAAGCTC CAGCGAGAGG CGCGGATCTC AGCGCGGGAG CAGTGCTTCT 60 

GCGGCAGGCC CCTGAGGGAG GGAGCTGTCA GCCAGGGAAA ACCGAGAACA CCATCACCAT 120 

20 GACAACCAGT CACCAGCCTC AGGACAGATA CAAAGCTGTC TGGCTTATCT TCTTCATGCT 180 

GGGTCTOGGA ACGCTGCTCC CGTGGAATTT TTTCATGACG GCCACTCAGT ATTTCACAAA 240 

CCGCCTGGAC ATCTCCCAGA ATGTCTOCTT GGTCACTGCT GAACTGAGCA AGGACGCCCA 300 

QGCGTCAGCG CNCCCTGCAG CACCCTTGCC TGAQCGGAAC TCTCTCAGTG CCATCTTCAA 360 

CAATGTCATG ACCCTATGTG CCATGCTGCC CCTGCTGTTA TTCACCTACC TCAACTCCTT 420 

30 CCTGCATCAG AGGATCCCCC AGTCCGTACG GATCCTGGGC AGCCTQGTGG CCATCCTGCT 480 

GGTCTTTCTG ATCACTGCCA TCCIGGTGAA GGTQCAGCTG GATGCTCTGC CCTTCTTTGT 540 

CATCACCATG ATCAAGATCG TGCTCATTAA TTCATTTGGT GCCATCCTGC AGGGCAGCCT 600 

GOTrOGTCTG GCTGGCCTTC TGCCTGCCAG CTRACACGGC CCCCATCATG AGTGGCCAGG 660 

GCCTAGCAQG CTTCTTTOCC TCCGTGGCCA TGATCTGCGC TATTGCCAGT GGCTCGGAGC 720 

40 TATCAGAAAG TGCCTTOGGC TACTTTATCA CAGCCTGTGC TGTKATCATT TTGACCATCA 780 

TCTGTTACCT GGGCCTOCCC CGCCTGGAAT TCTACCGCTA CTACCAGCAG CTCAAGCTT6 840 

AAG6ACCCGG GGAGCAGQAG ACCAAGTTGG ACCTCATTAG CAAAGGAGAG GAGCCAAGAG 900 

CAGGCAAAGA GGAATCTGGA GTTTCAGTCT CCAACTCTCA GCCCACCAAT GAAAGCCACT 960 

CTATCAAAGC CATCCTCAAA AATATCTCAG TCCTGGCTTT CTCTGTCTGC TTCATCTTCA 1020 

50 CTATCACCAT TOGGATOTTT CCAGCCGTGA CTGTTGAGGT CAAGTCCAGC ATCGCAQGCA 1080 

GCAGCACdC QGAACCTPAC TTCATTCCTG TGTCCTGTTT CTTQACTTTC AATATCTTTG 1140 

ACTGGKPTCGG CmSAGCCTC ACAGCTGTAT TCATOTGGCC TGGGAAGGAC AGCCGCTGQC 1200 

TGCCAAGCTG aiTGCrGGCC CGGCTGGTGT TTGTGCCACT GCTGCTGCTG TGCAACATTA 1260 

AGCCCCGCCG CTACCTGACT GTGGTCTTCG AGCACGATGC CTGGTTCATC TTCTTCATCG 1320 

60 CTCCCTTTCC CTTCTCCAAC GGCTACCTCG CCAGCCTCTG CATGTGCTTC GGGCCCAAGA 1380 



45 



55 
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AAGTGAAGCC AGCTGAGGCA GAGACCGCAG AGCCATCATG GCCTTCTTCC TGTGTCTGGG 1440 

TCTGGCACTG GGGGCTGTTT TCTCCTTCCT GTTCCGGGCA ATTCTOTGAC AAAQGATGGA 1500 

5 

CAGAAGGACT GCCTGCCTCC CTCCCTGTCT GCCTCCTGCC CCTTCCTTCT GCCAGGGGTG 1560 

ATCCTGAGTG GTCTGGCGGT TTTTTCTTCT AACTGACTTC TGCTTTCCAC GGCGTGTGCT . 1620 

10 GGGCCCGGAT CTCCAGGCCC TQGQGAGQGA GCCTCTGGAC GGACAGTGGG GACATTGTGG 1680 

GTTTGGGGCT CAGAGTCGAG GGACGGGGTG TAGCOTCGGC ATTTGCTTGA GTTTCTCCAC 1740 

TCTTGGCTCT GACTGATCCC TGCTTGTOCA GGCCAGTGGA GGCTCTTGGG CTTGGAGAAC 1800 

15 

ACGTGTGTCT CTGTGTATGT GTCTGTGTGT CTGCGTCCX3T GTCTGTCAGA CTGTCTGCCT 1860 

GTCCTGGGGT GGCTAGGAGC TGGGTCTGAC CGTTGTATGG TTTGACCTGA TATACTCCAT 1920 

20 TCTCCrCTGC GCCTCCTCCT CTGTGTTCTC TCCATGTCCC CCTCCCAACT CCCCATGCCC 1980 

AGTTCTTACC CATCATGCAC CCTGTACAGT TGCCACGTTA CTGCCTTTTT TAAAAATATA 2040 

TTTGACAGAA ACTAGGTGCX; TTCAGAGGCT CTCTGATTTA AATAAACCTT TCTTGTTTTT . 2100 

TTCTCCATGG AAAAAAAAAA AAAAAAA 2127 



25 



30 



40 



(2) INFOSMATION FOR SEQ ID NO: 158: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1625 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRAN0ECNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

CAAAAGATCT ATAATCAGGA CATTGTTTAT GTAAGTTGGA CAANAAAAAT TCTTCCCCTT 60 

TATGTCCACC CTTOCTATCA TTGCAAGACA AAATTTCCCT CCTTTACCTC ATCCCTATAA 120 

45 CATGGGAGGC TGAGAAAAAT GAGGGGAGAT GGAACCAGAT ACAAGGAGAT CCAATAAGAG 180 

AAGCTTATTT AAATATTGTG AAATAAAGGA AGAMCCAAAG CATTTTTTTA AGTGGGGAAT 240 

CCTTTTGAAC AGTTATTATT TATCCATATT ATTAAYAACA TCTTTTCTGA CAAAATCCAT 300 

50 

CAGATGAAGT GTAAATGGAT AATCTTTTAA TGGATCTAAA CCTAGAAAGT TTCACTTACT 360 

GTTCATGTCC GTGTTCCAGA ATTGTGAAAT GGTGTGTGGT TTTGCTTTCC AAGTTCTTCT 420 

55 CTGCCTCCTC TTAATTCTCT AATTCCATGT CTTACAGAAG AATGAGAAAT TTCTTTCTTA 480 

CTTGAGTATC ATGCTCTAAA AAACTTGGCT TCAGTCACAG AAACX3CTQGC TCTCCTGTGC 540 

TTATATTGAA GCCAACTGCC TTTAATTCTT QGGCCCTCTT ATATTTPTAA . GGTGCAAAAT 600 

60 
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TTGAAGTCTC AGTCACCAGA CACAGGTTCT ATACAATTAA TGATGAGCTG GAGAAGTAAT 660 

ATGTAGCTAA TTTTTCAAAA GCATTGAATA TACTTTCCGG AAAGAAAACA GAAATTAAAT 720 

5 ATTGCCACAT CTTQCCAGAA TCCCATCTGA CACCTTAACT TTGTCAQGTT TCCTACAACT 780 

TGCTAATCAA GTTTTATACA TTCTAAATCT CCCCAGTTTC TTTQGGGCTG GAAGATQCAA 840 
CTTCCMTTA ATAGAAACTT TGAAATCTTG GGGTAAGGGA GCAGTGGGQG GACTAGGGAG * 900 

10 

AAGGATAAGA AATAGAATTA TTGAAAAGCC CCCACCAGQG ACCTTCCTGG CCAGAATATG 960 

CAGAGTAATT CCTGCTGGCT TCACXnTTGA AAGTCCCTCG AAACTATGCA GATGAAACTG* 1020 

15 AGTCTGTTTT TGATATTGTC AGATGTATTC TACCTTGGAA GTCCCNACAC CTAAACTGGA 1080 

ATTCTTGTAT TTACATCTCC TCCACTGTCC CCCACACCAC CCX:TCAATTC CTGCTGCCCC 1140 

TGCTAATGTT AAGCATTTTT CTCTTGTTAT CATCAGGTTC ACATTAAAAM CAGRTACTTA 1200 

20 

CAAACTGACT TGAAGCACAG ATACTTTTAC GAATGTGATA AAATATTTTC TTAAGAAAAG 1260 

GAAAGAGGAT GTGGGTCAAA TAAAACACCX3 CATGGATGTT GATTGGTCAA TACTGGTGTA 1320 

25 AGAAAAGGGA GCTCAGGAAT TTTTATTACT GTATTTGTAA ATGAGTTTGA AGGAATTTGT 1380 

AAATGCCACT GGTACATTTT TAAGGTGACA CATT3X3CTCC TTATAAAGTT ATTAAAAATT 1440 

ACAGGGTAAG CTTAAATGAC GTTTGCCAGT AGTTTTACTT TATATAATCA ATATTGATAT 1500 

30 

TGTTGCTGAA CTATGTAACT TTATGATGCA TTTTTCAGTC CCTTTTCAGA GCAAATGCTT 1560 

TTGCAATGGT AGTAATGTTT AGTTTAAATT GACTTAATAA ATTWrTACCT GAGCAAAAAA 1620 

35 AAAAA 1625 

40 (2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1687 base pairs 

(B) TYPE: nucleic acid 
45 . (C) STRANCSEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

50 CGGGGTCACC AGTTATTAGA GGAAGTAACA CAAGGGGATA TGAGTGCAGC AGACACATTT 60 

CTGTCCGATC TGCCAAGGGA TGATATCTAT GTGTCAGATG TTGAGGACGA CGGTGATGAC 120 

ACATCTCTGG ATAGTGACCT GGATCCAGAG GAGCTGGCAG GAGTCAGGGG ACATCAGQGT 180 

55 

CTAAGGGACC AAAAGCGTAT GCGACTTACT GAAGTGCAAG ATGATAAAGA C3GAGGAGGAG 240 

GAG6AGAATC CACTGCTGGT ACCACTGGAG GAAAAGGCAG TACTGCAGGA AGAACAAGCC 300 

60 AACCTGTGGT TCTCAAAGGG CAGCTTTGCT GGGNATCGAG GACGATGCCG ATGAAGGCCC 360 
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TCGAGATCAG tcaqgcxx::ag CTGTTATTTG AGAACCGGYG GAAGGGACQG CAGCAGCAGC 420 

AGAAGCAGCA GCTOCCACAG ACACCCCCTT CXrTGTTTGAA GACTGAGATA ATOTCTCCCC 480 

5 

TGTACCAAGA TGAAGCCCCT AAGGNAACAG AGGCTTCTTC GGGGACAGAA GCTGCCACTG 540 



GCCTTGAAGG GGAAGAAWVG GATGGCATCT CAGACAGTGA TAGCAGTACT AGCAKTGAGG 600 



10 AAGAAGAGAG CTGGGAACXC TCCGTGGTAA GAAGCGAASC GTGGGCCTAA AGTCAGATGA 550 



TGACGGGTTT GAGATAGTGC CTATTGAGGA CCCAGCGAAA CATCGGATAC TGGACCCCGA 720 

AGGCCTTGCT CTAGGTGCTG TTATTGCCTC TTCCAAAAAG GCCAAGAGAG ACCICATAGA 780 

15 

TAACTCCTTC AACCGGTACA CATTTAATGA GGATGAGGGG GAGCTTCCGG AGTGGTTTGT 340 

GCAAGAGGAA AAGCAGCACC GGATACGACA GTTGCCTGTT GGTAAGAAQG AGGTGGAGCA 900 

20 TTACCGGAAA CGCTGGCGGG AAATCAATGC ACGTCCCATC AAGAAGGTQG CTGAGGCTAA 960 



GGCTAGAAAG AAAAGGAGGA TGCTGAAGAG GCTGGAGCAG ACCAGGAAGA AGGCAGAAGC 1020 



. CGTGGTGAAC ACAGTGGACA TCTNCAGAAC GAGAGAAAGT GGCACAGCTG CX3AAGTCTCT 1080 

25 

ACAAGAAGGC TGGGCTTGGC AAGGAGAAAC GCCATGTCAC CTACX?rTGTA GCCAAAAAAG 1140 



GTGTQGGCCG CAAAGTGCGC CGGCCAGCTG GAGTCAGAGG TCATTTCAAG GTCGTGGACT 1200 
30 CAAGGATGAA GAAGGACCAA AGAGCACAGC AACGTAAGGA ACAAAAGAAA AAACACAAAC 1260 



GGAAGTAAGC AGAGCTGCCA GQCTCCXrAGG AGAGCATGGG GACTAGGAGG AAGGGTGTQG 1320 



CATGGCTCAG TCTGGCCCCC TTGATTACCG GCCTAGCCCC TGCTCACATC ACAGCTGTCT 1380 

35 

GAAGAACAGT GAGGTGGAGT GCCTAGAACT CCCGTGGTGG TCCTGAGCAG AGAGGAGGAT 1440 

GTCCTCCTGC CTGCCTGAAG GTCTCCCATG AAAACACTGC TGAACTGTGT TGACACTCAT 1500 

40 GACCCTTTTT TTAAACCGTT AAAQGGAAGT TCGGTGTTGG AGCGATACTC AATGTAGTCA 1560 



GTCTACACCT GGACGTGTCG GCCACTTAAG CCCTCCCCAC CCCCATCCTA TTCCTRAATA 1520 

AAACCAGGAT AATGGAARAA AAAAAAAAAA AAAAAAAAAG GGGGGGCCCM TAAAGGCaK:C 1680 

45 

CANNTTT 1687 



50 

(2) INFOBMATIOM FOR SfEQ ID MO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 1842 base pairs 
55 (B) TYPE: nucleic acid - 

(C) STRANDEI»3ESS: double 

(D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
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GGATGACAGA TTGCGACANA GATTTGTGAC 
ANCAGCGTAT GATCAAAGAC AAAGGCAGGG 
5 CX3CATCTGTG CCCCGAGAAT CCTTTACTTC 
GAAACAAGAA GAAAAAAACC ATTGGTTCAC 
AGCTGCTTAA CAGTCCTGCA AAAACTCTGC 

10 

TTGATGGGTT TCTAAAACAT GAAGGACCTC 
CTTCTACTTC AGGTC3TGCCA GGCCTTTCTA 
15 GACCTCCAGC ACCCAATCTA GCTGGAGCTG 
GAGAATGGAT AACTACAATT TCAGATCCAA 
ACTGTACTGA TCTAATAGAA GAAAAAGATT 

20 

TGAAAAGGCT GATGCAGCAA TCQGTGGAAT 
TTGACAATGT CCAGGTGGTT TTACAACAAA 
25 TATTACCAGA GAGCCTGATG CTCTCTGATA 
CAAAGTGCAT GATAGTAATG CTCGGACSTTT 
GTTTTGTACA TTTCTTTTCA AAAAGTGCCA 

30 

GTTAATTATT TTACTGTAGC ATAGATTCTA 
GGATTTTTAC AGTGAAGTGT TTACAGTTGT 
35 RQGCTCCTTT TKGTQAAYCC TTAAAAACTC 
CTAAARGGCT GAAAAMCCTC CAGGQCAGAC 
AGACCGGGAT TCTACTTGTT CCAAGAAAGG 

40 

TCCAAGCAT6 AACACAOGAG CATGrVAAGA 
AATCTACATA TTTTGAATTA GAAACACCCT 
45 ACATTATGTC CCGTAGATCA GAGGTGGTGT 
ACTTTGATGA TAAAAAAGAA CGGTATAGAT 
TATATGTTAT GCCATAACTT TAAAATAAAA 

50 

TGGAACTTTT TCCTCAAACA AACACCXTAC 
ACAGATTACT ACTACGAATG AATCATYAAG 
55 AGCCCAAATA TCAGGAAATG TGTGTATGAT 
TAAAACAGGA TCAAGGATTA ATGGTATAAA 
AAACAGGATC AAGGATTAAT GGTATAAAAA 

60 



412 

CCTTCCTGCT GAACTTCAGA GGGAGCTGAA 60 

CGAGAACAGC ACTCACCAGC AGTCAGCCAG 120 

ATCTAAAGGC AGCAGTGAAA GAAAAGAAAA 180 

CAAAAAGGAT TCAGAGTCCT TTGAATAACA 240 
CAGGGGCXnx; TGGCAGTCCC CAGAAGTTAA " 300 

CTGCAGAGAA ACCOCTGGAA GAACTCTCTG 360 

GTTTGCAGTC TGACCCAGCT GGCTGTGTGA 420 

TTGAATTCAA TGATGTGAAG ACCTTGCTCA 480 

TGGAAGAAGA CATTCTCCAA GTTGTGAAAT 540 

TGGAAAAACT GGATCTAGTT ATAAAATACA 600 

CGGTTTGGAA TATGGCATTT GACTTTATTC 660 

CTTATGGAAG CACATTAAAA GTTACATAAA 720 

GCTGTGCCAT AAGTGCTTGT GAGGTATTTG 780 

TTATAATTTT AAATTTCTTT TAAAGCAAGT 840 

AATTTGTCAG TATTGCATGT AAATAATTGT 900 

TTTACAAAAT GTTTGTTTAT AAAGTTTTAT 960 

TTAATAAAGA ACTGTATGTA TATTTGGTAC 1020 

AACTCTAGGA RGCAACTACT GTTTATTATA 1080 

TGCTAAGCTC TGAAATYCCT GAGAGGTCTC 1140 

GTAAAGCTTC TAAACCATCT TATTCTTGTC 1200 

AAATCTTTAC TACTTTCTYC CATGCGGAGA 1260 

CACACCCACT TGAAGATTTT TTTCCTGGGA 1320 

TGTCTTTTTG CTTCTACTGG CCATTGAGAA 1380 

TTTTCAAACG TATATAAAAT ATTTTTATGT 1440 

ATAGTTTAAA ATTCTATGCT AGTGGATATT 1500 

ACTGACTTCA QCAAAACCCT AAAACTAGCT 1560 

TTTTGTGTCT QCAACAATTT AGAAGCACTA 1620 

GGAATTTTCT AGGACAAAAC AGATCAAGAT 1680 

AATGGTCTAC TAAAACAGGA TCAAGGATTA 1740 

TCTCTACTGG TTACCGGGTG GCNGGGCCAT 1800 
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ACAGQGTAGT GGTGGATGGA TAGTTTAGTT TGGNAAGQGT AA 1842 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 770 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 161: 

GGCACGAGCC CTATGCTGTT CTTGTGATAA TGAGTGAGTC TCACAAGATC TGGTGGTGTT 60 

ATAGGCATCT GGCATTTCCC CTGCTGACQC TCATTCTCTA TCCTGCCACC CTGGGAAGAA 120 

20 GTGTCTTCTC TCATGATTGT AAGTTTCCTG AGGCCTCCCC AGCTATGTAG AACTGTGAGC 180 

CAATTAAACC TCTTT T CTCT ATAAATTATC CAGTCTTATA TATTTCTTCA TAGCAGTGTG 240 

AGAACAGATA ATACCGTAAA TTGGTATCAC AGAGAGTGGG GTGTTGCTAT AAACACATCT 300 

25 

GAAAATGTTA AAGCAAATTT GGAACTGGGT AACAGGCAAA GGCTGGAACA GTTKGAAGAA 360 

CAGTTAAGAA GAAGACAGGA AAATATGAGA AATCTTGAAA CTTCCTAGAG TCTTAAAOGT 420 

30 CTCAGAAGAC ATGAAGATGT GGGAAGCTTT GGAACTTCCT AGAGACTTGT TTGAATGGCT 480 

TTGACCAAAA TGCTGATAGT GATATGGACA ATGAAGTCCA GGCTGAGCTT ATCCAGACAG 540 

ACATAAGAAG CTCGCTGGGA ACTTGAGTAA AGATCACTCT TGCTAGGCAA AGAGACTQGT 600 

35 

GGCCTTTTTT CCTCTGCCCT AGAGATCTGT GGAAATCTGA ACCTGAGAGA GATGATTTAG 660 

GGTATCTGGC AGAAGAAATA TCTAAGCGGC AAAACCTTCM AGAGGAAGCA GAGCATAAAC 720 

40 GTTTGAAAAA TTTGCAGCCT GACNATGGGA GACCAAAGTT AAACCCAATT 770 



45 (2) INPORMATIC»I FOR SEQ ID NO: 162: , * 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: doxoble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SBQ ID NO: 162: 
55 GAATTCGGCA CGAGCTGAGA GGCACAGGAG CAACAGCCAG TGCCCCCTGC AGAGGACCAC 60 
TGGGGTCACA GACTTCARAC CTGATGACCT GGGCTCAGAT CCCAGCTCTG CACCTACCAG 120 



60 



CCGTGTGACA AQGTGTOCTC TCTGAQCCTC AGTCACACAC TQCCTTAACG GTTQQGCCTC 



180 



wo 98/54963 



414 



ATQGAGCTGT TTGTCAAGGT TAAATQGGAA GACATAAAGC ACTTAGCCCA GAGCCAAGGA 
CATGCTGAAT AGGATAATGG TQGCCTCCTT TGGCGCTGTG CTGC5TGCAGG TGTGCCGAGG 
AAYTGGGCAG GGGTGACAGA TACCTCTTCT AACCTAGTTC CTTTCCAAGA ACCTAATTGG 
TGTCTCTCCC TCCCCCAGGC AATTGGAAGG AGGAGGCTGG GCCCCAGCCC CAGAATACGG 
GAGGTTTCTC ACCGTGGTAG GGAAATTGCT GGGTTGGGGG TGTGGGCAAC CACAGTGATC 
GTCTCTCTGC AGGftCGGATG AGGCTTTQCT GACAGAGGC 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHT^CTERISTICS : 

(A) LENGTH: 753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
GGCACGAGCG GCACGAGCAG CCAGTTGCTG ACTGGCACAT GGCCTCCAGC GTCCCGGCTG 
GTGGGCACAC TAGAGCCGGA GGGATCTTCT TAATTGGTAA ATTGGATCTT GAAGCTTCAC 
TGTTTAAATC TTTTCAJGTQG CTPCCCTTTG TACTTAGAAA AAAATGCAAC TTCTTCTGCT 
GGGACTCATC CGCTCACAGC CTTCCCCTCC ACCCTCTCTC TGCCTCATGC TCTGCCCCTG 
CCTGCCATGC CTCCGATACT CACCTTTTGT ACCCCAGCAC CCGTGCCCTC TGCCCCTCGA 
TCTTTGCCTG GCTGGTTGCT CCTCACTCAG TGTTCAGGAC AAATGCTCCT GGCCCTACCC 
CATCTAGCCA GTCTAGCCCG GTCTTCCCTG TCTTCCCTGT TTCATTCATG GCTCTTATTG ' 
TTTGTTWACT TGTGTGCTGT TGACTTTTAA CTCTCTCAGT CCCCACTGGA ATGCAAGCGA 
TCTCCCAAGC TCCTAGAATT GTTCCTGCCT CTTCACAOGC CCTTACQCTG TGTGTGCTOG 
TGCCGAATTC QGCACGAGGG TATGTGCACT TGCTGGTATG TATGTAGGTG TTTGCTAACA 
CATACGTGCA CACGCAGAAT GCTTCCftGGG GACTGCACAG CCTCTAGTTC GCAGCCCCCA 
CCCCTCCCTT TGSCOCTGCA CTCTCCCCTC TCTGAGCTGC ATTCGCATGA AAGQGTGCftN 
GGTTCCTGAN CCCGCNAGCG NCACCTCCTG GGA 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1400 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS: double 
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(D) TOPOLOG*/: linear 



(xi) SEQUENCE DESCRZ?TrCN: SEQ ID NO: 164: 
5 GGCACAGTTT ATTAATACCT ATTATGGGAA AGTCACTTTG GTTGGCATTG AAAATTACAT 60 
CATCTTTAAA GCAGTATTTG TCCCCAGATG GACTCATCAC TAGCAAAGAC TAGGTTCATT * 120 



GGAAGGCATA GQGTGAGAGA ATGGGMGAT GRAGTGG3VGG CGC3GTTGTTA AAGTGCTGTC 180 

10 

AGTGAGTGAT TTTGTCTACT TGAATAATGG TCCATGTTTG GGGGCATATT GTGTTTCATA 240 



AGAAGTGAAA GGTATTTGCA AAGTAAGCTA CAAATGACCC ATAAATCTGT TAACAACAGT 300 
15 CCTTAATATG CAAiSkGATGAA AAACAAGCAT TACTGCTACC CAAAGGGAAC TGGTGCTTGG 360 



TGATGTGCAG ATGGGGCTGT TGGTTAAGAG AGCTATTACA GGTTTTCTCT CTTAGGTTTC 420 



ATAGGAGGTA GTTACTGAGA TGAGATTGTT TTATCTTTTT GAATACAGAT CTCTTGTCTT 480 

20 

GAGTTAGrrC TGAGGATGGG AGTAATAAAG GAGTTTTTTG TTTTTTTGTT TGTTTGTTTG 540 

TTTTGGCTCC TTAGTAATAC TCCTCTGACA TTTATTTCTA TTATTCTTCA AAGAAAGGAA 600 



25 ACCAACTGAA ATGTTTGCTT TAACAAACAT TTTAATAAGT TCTCTGQGTT TTTTTTTCCC 660 



CTTTTAAAAA AATTAGCATA TACCATAGCA ATAAAAGAAC TAATGTTAAC TATTGTATGC 720 



TACAACTTAA GTGATTTTTC TAAAGAAGCA CAATGTCATT GRAAGTATTA TTGAAAAC3GA 780 

30 

TCATAGTCAC ATTGAATTTG TGAAGGCCAA AGAAATTGAA GGGAGTGATA TTTTCATTTT 840 



ATGATATTCA CATATTTAGT AAATTTTGTG TACAAGAATA CCAGGCAGAG TGTTTTACCC 900 



35 ATGGAAACAG GTTTCAGATT ACTTTGTTTT TACTGTTAGA GTCTCAAGTT TAGAAATGCT 960 



AACACTTAAA TCAGTTTTTT TCTCACTATA CTTGAAGATT GTTAATATTT TGATATCTTC 1020 

CTAGCTTGAT GGAATTTAAA CATATCTTCA GATCTGTGAC AGTGACAGCC AATAGGACTG 1080 

40 

ATAATATTAG CTTCAAACCA ATAATATCCA GGGTTAAAAT AAAAATCATA GTGAAAGTAC 1140 



GATTGTAAAA TTATGCTATA TTAACTTTrA AGTCTGTAAT AACTTGACAT CAAAATGTTA 1200 



45 TGTAATTACC ATAAATAATG GCTAGCGAGA ACATCnTGG AAATTCTCAA ATTACCTTTC 1260 



TTACTACACT GTTTGCAGAA TGAATGTAGA AATGATCCTG TTAGCTTTCT GAATGTTCTG 1320 

TGGTTGAATG TGTTTTTGCT TAAATAAAGC TTTTQGTATT TGTTTAAATW ACAAAAAAAA 1380 

50 

AAAAAAAAAA AAAAACTCGA 1400 



55 

(2) INFORMATICai FOR SEQ ID NO: 165: 

(i) SEQUENCE CHAHACTERISTICS : 

(A) LENGTH: 2153 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Ui) SEQUENCE DESCRIPTION: 5EQ ID NO: 165: 

5 

CAGGCCTCAG GGCCTCTGGT QGCTCTQGCC CAGACAGTAT TTGCAGTTCT TGTGCTATGG 60 

GTGGGAGTCT TCTTCCTCAA GTTTCGGCAG CTGTGCTGTG NCTGGATGGG CTGCTCCTCC 120 

10 CAGGGCTCAA GGCCTCTGGT CCGCTCAGGG TCTCATTTCC CCAGGCCAAG TTCAAGGCAG 180 

CAGCCCTTTC TGAGGCGCTC TTGGCCCTQG GCTGGAGGGA GAACTTTAAG CTTTTTTGCT 240 

CACAGGGACG TGGTATGGGC CCTGGGTGCA GGTGCCCACA TTCTGCTAAT GAGAGCTTTC 300 

15 

TCTGATCAGT CCTGGGTGCA TCAGTTTCTC CATGTGTCCG GCTGCCAGCC CGTCCCTTGG 360 

GATCCTTCCC CTGGGGTGTA GCCTTGTTCA TTAGTATATA CTCATTCCTT CATGCTTTCC 420 

20 TCAGCAGAAC ACTTCCACTT CTGAGGTGAG CTTTTGCCCC RTGCCCTTCC TCCACAGGTG 480 

TTGCCTTTTT ATAAAGACCT GATAGCAGAA TAAATTGGTG TTTCCCTGTT GACCCAGCAC 540 

CATTTCTGTC GGCCTAGAAT ATGGCCCTCA ACCCTTAGAG TQGGGCAGTC AGGGCTTGAG 600 

25 

GAGTGACCCT TCCTTTCTCA TGGTTTTAGT CATTTTQGCT GCCAGCCCTT AATGGCACAG 660 

ATCTGCTGCT TCTAACAGAT GGCCAGGAGG TGACACCGAT TTCAGCCATT QCCAAGGTTA 720 

30 GCACCCTCTC CTTTGAGCCT AGGGCCACAC TGTTCATTGT CACTTTAGGC AAGTGCCTGT 780 

TTQGCTTTAA AQGTAAQCCT GCCAGCTGTC AGAAGCCTTG GTAACTGATG GACTCATTTC 840 

CTGGTCCTTA AAGATGCAGC CTCTTAAGGG CTCCTTGATC GATGCCATCT CTCCTAGCCC 900 

35 

CCAGCCCTGG TGCCACTGGT GGGCAGGTTC CCATTCTTTG GGGCTGGGAG GGACAGCTTG 960 

CCTGTTTCTC GTCACAAATT ACAGTCTTCT CTCCTGTACC ATTCTGTGGC TTCAGCATGG 1020 

40 GQGCAGTAGC CTTTCATTAG TGTAGATAGT CATTCCCTGG TAGQGTQGAG GGTAAGACAT 1080 

AQQGTCTQGA ACTGTTTQGG ACCTTTTGGG GATGTCCTGT GCCTCCCAGA TTCCTMGATT 1140 

CTQGGAGGAG AGGCTGCCGC ATTCTGCTGC TCCTCACAGC GAGCAAAGCT GCACCCACTT 1200 

45 

ACATTCAGTA TTTTCCTGGC ACTACAAAGA GTGGGAAGGC CTGGGATTTG CTGCTGCTCC 1260 

CTTAGAGCAG GGCCCCTnT TTCAGCACTT TGGACACCTG GAGACCCAGC CCTGTTATTT 1320 

50 AATGGTAGTG GGCAAGTGTG TGTGCATACT GTCTGCCACT GCTTTCTCCC TGCCCCATGC 1380 

CAGAGAGCCC TQTCCCTGCC AGGCCCAGCC TTCTTAOCCC CAACTTGGGA ACAAAGTGCA 1440 

ACATGGGATC ATGGGTTGGG GTGCTCAGGT GAGCCCTCTC TATAGTGCTT CCCTGGGCCA 1500 

55 

AGCTGACACC AGCCCCTGAG GGTGGGGTGG GACGGGTGGT GCTTAAAAGA GGAAGGGGAC 1560 

CAGTGTAGCA ACTTGCCAGG GACCCCACCC CTCCCTCTCT QGGCCTGTGC AGTGAGCATC 1620 

60 GGGATTCCCA TCAAGGGGCC TGGCACCTGT GCTAGTTACG TAGCCGCTGN TCAOGCGCTC 1680 
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ACTCCTGACC ACATGCACGT TCCCTAGATG CAGACTQCTT TCAACTTTAA AGCTGTACAA 1740 

TTTGC3TTATG TTTX5TGCTGA CTTAAAATAT ATTTTAATGA GGAAAAAATA ATGGAGAACC 1800 

5 

CTGGGAAGGA CCTO3TTCTT TTGCTTCTCG GQGAACTGTA AGCOTrCGCG TTCTGGGAAT 1860 

CGCTCTCTGC TGCTCTTTCC TGGAAGCTAA C3CCTCTCTCC ACCX^CCCGAG GCCTGCGCCG 1920 

10 GTGCTCCCGC CGCAGTTGCG TTTGCTTTGG ACCTTGCGTG CGGGGGAGGG GGTGCTCGGT 1980 

CCGAGCCCGC TCCTTTCTGT ACACCTAGCG CTGCCCGCCC CGCTTGTGTC TGAGGTCGTG 2040 

TATGTCAAAA ATAAAGCCGC TAGAAACGGA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA . 2100 

AAACTCGAGG GQGGGCCC6T ACCCAATTAA CCCNNTATGA TCTATAAAC3C GTC 2153 



15 



20 



30 



(2) INFORMATIGN FOR SEQ ID NO: 166: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY': linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 166: 

GCCCACGCGT CCGCCCACGC GTCCGGCGGT GCGGAGTATG GGGCGCTGAT GGCCATGGAG 60 

GGCTACTGGC GCTTCCTGGC GCTGCTGGGG TCGGCACTGC TCGTCGGCTT CCTGTCGGTG 120 

35 ATCTTCGCCC TCGTCTGGGT CCTCCACTAC CGAGAGQGGC TTGGCTGGGA TGGGAGCGCA 180 

CTAGAGTTTA ACTQGCACCC AGTCCTCATG GTCACCQGCT TCGTCTTCAT CCAGGGCATC 240 

GCCATCATCG TCTACAGACT GCCGTGGACC TQGAAATGCA GCAAGCTCCT GATGAAATCC 300 

40 

ATCCATGCAG GGTTAAATGC AGTTGCTGCC ATTCTTGCAA TTATCTCTGT GGTGGCCGTG 360 

TTTGAGAACC ACAATGTTAA CAATATAGCC AATATGTACA GTCTGCACAG CTGGGTTGGA 420 

45 CTGATAGCTG TCATATQCTA TTTGTTACAG CTTCTTTCAG GTTTTTCAGT CTTTCTGCTT 480 

CCATGGGCTC CGCTTTCICT CCGftGCATTT CTCATGCCCA TACATGTTTA TTCTQGAATT 540 

GTCAIPCTTTG GAACAGTGAT TGCAACAGCA CTTATGQGAT TGACAGAGAA ACTGATTTTT 600 

50 

TCCCTGAGAG ATCCTGCATA CAGTACATTC CCGCCAGftAG GTGTTTTCGT AAATACGCTT 660 

GGCCTTCTGA TCCTGGTCTT CGQGGCCCTC ATTTTTTGGA TAGTCACCAG ACCGCAATCG 720 

55 AAACGTCCTA AGGAGCCAAA TTCTACCATT CTTCATCCAA ATGGAGGCRC TGAACAGQGA 780 

GCAAGAGGTT CCATGCCAGC CTACTCTGGC AACAACATGG ACAAATCAGA TTCAGAGTTA 840 

AACAGTGAAG TAGCAGCAAG GAAAAGAAAC TTAGCTCTGG ATGAGGCTGG GCAGAGATCT 900 
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ACCATGTAAA ATGTTGTAGA GATAGAGCCA 
AGTTTTGCTT CTCCTATTAG CCATATGATA 
AATCACAAAG GATGGTTTCT TGAAATAATT 
AATTGGAAAG GATGTGATTA ATATAAATAA 
TTATCTTGTT GAQGACCACA ACATTAGCAC 
GTGAATATGT GTCTACTAGT AGTTAATTGG 



418 

TATAACGTCA CGTTTCAAAA CTAGCTCTAC 
ATTGGGCTAT GTAGTATCAA TATTTACTTT 
TGTATTGATT GAGGCCTATG AACTGACCTG 
TAGCAGATAT AAATTGTGGT TATGTTACCT 
GGTGCXrTTGT GCAKAATAGA TACTCAATAT 
ATAAACTGGC AGCATCOCTG A 



(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 base pairs 

(B) TYPE: nucleic acid 
<C} STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

GACSMTCTAG AACTATGGTC CCCCGGGACT GCAQGAATTC GGCACAGCGG CTGCGGGCGC 

GAGGTGAGGG GCGGGAGGTT CCCAGCAGGA TGCCCCGGCT CTGCAGGAAG CTGAAGTGAG 

AGGCCCGGAG AGGGCCCAGC CCGCCCGGGG CAGGATGACC AAGGCCCGGC TGTTCCGGCT 

GTGGCTGGTG CTGGGGTCGG TGTTCATGAT CCTGCTGATC ATCGTGTACT GGGACAGCGC 

AGGCGCOGCG CACTTCTACT TGCACACGTC CTTCTCTAGG CCGCACACGG GGCCGCCGCT 

GCCCACGCCC GGGCCGGACA GGGACAGGGA GCTCACGGCC GAYTCCGATG TCGACGAKTT 

TCTGGACAAK TTTCTCAGTG CTGGCGTGAA GCAGAGTGAC YTTCCCAGAA AGGAGACGGA 

GCAGCCGCCT GCGCCGGGGA GCATGGAGGA GAGCGTGAGA RGCTACGACT GGTCCCCGCG 

CGAMGCCCGG CGCACCCAGA CCAGGGCCGG CAGCARGCGG ANCGGAGGAR CGTQCTGCQG 

GGCTTCTGCG CCAAYTCCAG CCTGGCCTTC CCCACCAAGG AGCGCGCATT GRAOSACATC 

CCCAACTCGG AGCTGAGCCA CXTTGATCOTG GACGftCCGGC ACGQGGCCAT CTACTGCTAC 

GTGCCCAAGG TQGCCTGCAC CAACTGGAAG CGCGTRATGA TCGTGCTGAG CGGAAGCTGT 

GCACCGCGTG CGCCTACCGC GACCCGYTGC GNTCCCGCGC GAGCACGTGC ACAACGCCAG 

CGCGCACTGA CTTCAACAAT TCTGGCGCCG CTACGGGAAG TCTCCCCCAC CTCATGAAGT 

CAAGCTCAAG AATACACCAA TTCTTTCTQC GCGACCCTTC TG 



(2) INFORMATION FOR SEQ ID NO: 168: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENC?rH: 1208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 168: 

GGGAAACTCA AAAGGAT6AT GGAATGGTT6 ATGGAGCCAG AGCCTAGAAG TRAAGGGATA ' 60 

10 

CAGAGTGAAG ATAGAGGTAT TTACGTATAT TTWAATATTA GCTTTGGAAT TACGTAGGGA 120 

TTCTTAAGAA AAGATCATGA CAGGACAGCC ACATTTQGTA AAATGTCAGG GCAGCCAGTG 180 

15 CATGGTCCTC CTGGGGCTCC TCAGTTGACG GGTTTAAATC ATTTCCTGAT CCCCCTGCCC 240 

TGGTTTGAGG AATGCATACA GTACGTGAAA TGCCTGTGGT ATGAGTTGCA ATGGGCAATC 300 

AACCTGGGTA AATCCAAGAT TAATGATTAG TTCTAAAGAT CCAGTTGAAG TTCTAGAGTG 360 

20 

GGAATTTTCC GTCAAGCARC TCAGCACAGC TTTATGCCTG TTCCTCTAAT. AACGATAGGT 420 

AACAAATAGC TGTGTKTWCA CAGCTAGGAR GATAACCAAA TCTAGAGTTC TTGARTCTCA 480 

25 TTTAATAAAT AAKTATTATG AGTACCAACT GCATATTTCA QGCACTGCAT TTGACTCTGT 540 

TAAATACTGA TYCCTTAKGA CMSCCACOTC AGAWAAOfTT AATCTGTCTG ATCAATAAAC 600 

AGCTTGACTT AGAGRGGTAA AATAGCTTGC - CACAGGTWAC CCAATTAGTA GGTAACAGCG 660 

30 

ACAGAATAAC AGTGCAGTTA AAATCTTAGA CTGGAGACTA ATTGCATAAG TTTGAATTTC 720 

AGTTCTGCTA TGTAAATTTG GGTGW3TACC TTAATTYACC TGAGTCTCGG TCTTTATATC 780 

35 TGTAGAATGG AGCTAATGAT ATTACTTAAT TTGCTTTATG TGAGATTAAA TGTACTAATA 840 

TATGTAAATC ACTTACAACA GCATTTGACA TATTTGACAT ACTTAATATA TTTGCTACTA 900 

ATACTATTAG CAACAGCATT CT6ATTTTCC AAGTTGAAAT TCAGTGTTTT CTTTTTTACT 960 

40 

TTGCCATAAT TTACAATC3TT GTGCTCTGTA AACCATAAAT TTCCCTGAGG TGTTGTCAGG 1020 

TTAAAAAAAA ATCACTATGG CCCCCARNMA CTTGGAAAAT AGAAATGAGA CCAGCTTCAT 1080 

45 CTATATTCTT TACTGCAAAT AACITAGAAT TGTAATAGGC TAATATGTAC TGGGACTTCC 1140 

AATTTGGGAA TATGACAAAA ATAATACTAT TTAGCTAAAA CATATACAGA ACTTATTTTT 1200 

CCrCTGAA 1208 

50 



55 



(2) INFORMATK^ FOR SEQ ID NO: 169: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1307 base pairs 

(B) TYPE: nuclieic acid 

(C) STRANDEDNESS: double 
60 (D) TOPOIjOGY: linear 



wo 98/54963 



PCT/US98/11422 



420 



(xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 169: 

^ GGCACGAGAG AAAAGAGC3TT GAGAATGTTT TCTAGCAGQC AGAATGTGCA TACATOmT 60 

CATGARTGTC CTTTGGGTGC TGTTrCTTTT AAATCCTCTG TGCACAGGGC TCTGGCCTTT 120 

ARTAAACTGT TTOTCTGTCT TACGTCATGC TGACTGC3GTG CTAGGGGCTG ATTACAAAGG 180 

10 GGAAGAGTTG AACAGACATC AGGGGCCGAT GAAACCAAAG GACTAGGAGT CAGGAGAACA 2^0 

AGTCAGGGAT TAGGAGACAG CGGITTGGTT TATTGTTATC CAGCTGGAGG ACTCCTAGGG 300 

GCAGCAGCAG GAGGAATACC AGGGCCACGG AGGGGCAGGA GTCTCACAGT GGAGGGCAGA 360 

CTCTAACAGA TGCCAGCTGA ACGCTCGCTG GCCCTGGATG TCATACGAGT TGGGGACCAG 420 

AAATCTGGGC TCAGAGAAOC CGTCCAGGGA GATTTGAAGC CATGGGTTAT CTTCTAGAGT 480 

20 TGATACTGAT AATATATTTT AATTTTTATT GATGTTTAAT ACCTTCTGAA ACAGGAGGOT 540 

AAGATCAGAT GGGAAGCCCY TCTGTTGAAG GATCTTGGGA ACCTTGGTGG T rri TlTl T r 600 

TTGGTTTTTT TTTTTTTGAT CGAGCTGTGG ACATCCTTCT TAATTCGATT OTGAGGATTT 660 

23 

GTTTAACTAA AAAGTTCCCA AACACAGAAA GGGCCTCCCC ACCTGCTTTC GGGAGCTOTC 720 

TGTSCTGGGA GTGCCAGGCA TCCSATQGGA CCCATCACTG CCAGTGTCTG TGCCTCCCAG 780 

30 AQGTCAGCCC TGTGTCTGCC CTGGCTCTQT CTCCTCTGTG ACAGGGCAGA GCATTTCTCG 840 

TCAGTTTCTC CATGGTGCCT CCCACCCCTT TGTAAAGTGG ATGGACATGA TGQAATTCAG 900 

TTGTCTCACC CTGATAGCCT GQGTGTTGAT ATTCACTTTA CCCGCACTCA GACACAGGCG 960 

ACCTTGAAGC AGTTCTCGGT GTGTAGAGTC CACGTGACAG TCCCCACAGC CTCCCCAGAT 1020 

AGCTGTGTGC CTGTGCGCTA CTGCTGTGCC ATTTTCCCAA CTTNGGCGTT TCACTAAATG 1080 

40 CAGCTGATCT CTCTCTCTGT GCACTCGTGA TCCATGrXGA ACAATACATG TAGGTTCTTT 1140 

TTCCACGCAA TGTAAGAACA TGATATACTG TACGTPGGAA AGCATTTACC TTATTTATAT 1200 

ACCTQAATGT TCCTACTACA CAAATAAACA TATATTAAAT WCTAAAAAAA AAAAAAAAAA 1260 

CTQGAQGQGG GGCCCGGTAC CCAAATOGCC GGATAGTGAT CGTAAAC 1307 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1624 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 
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GGCACGAGGT CGCCGCCXXX3 GCCGCCTGGA ATTGTGGGAG TTGTOTCTQC CACTCGGCTG 60 

CCGGAGGCGA AGGTCCCTGA CTATGGCTCC CCAGAGCCTG CCTTCATCTA GGATGGCTCC 120 

5 TCTGGGCATG CTGCTTGGGC TGCTGATGGC CGCCTGCTTC ACCTTCTGCC TCAGTCATCA 180 

G2VACCTGAAG GAGTTTGCCC TGACCAACCC AGAGAAGAGC AGCACCAAAG AAACPGAGAG 240 

AAAAGAAACC AAAGCCGAGG AGGAGCTGGA TGCXX3AAGTC CTGGAGGTGT TCCACCCGAC 300 

10 

GCATGAGTGG CAGGCCCTTC AGCCAGGGCA GGCTGTCCCT GCAQGATCCC ACGTACGGCT 360 

GAATCTTCA6 ACTGOGGAAA GAGAGGCAAA ACTCCAATAT GAGGACAAGT TCCGAAATAA 420 

15 TTTGAAAGGC AAAAGGCTGG ATATCAACAC CAACACCTAC ACATCTCAGG ATCTCAAGAG 480 

TGCACTGGCA AAATTCAAGG AGGGGGCAGA GATGGAGAGT TCAAAGGAAG ACAAGGCAAG 540 

GCAGGCTGAG GTAAAGCGGC TCTTCCGCCC CATTGAQGAA CTGAAGAAAG ACTTTGATGA 600 

20 

GCTGAATGTT GTCATTGAGA CTGACATGCA GATCATGGTA OGGCTGATCA ACAAGTTCAA 660 

TAGTTCCAGC TCCAGTTTGG AAGAGAAGAT TGCTGCGCTC TTTGATCTTG AATATTATGT 720 

25 CCATCAGATG GACAATGCQC AGGACCTGCT TTCCTTTGGT GGTCTTCAAG TGGTGATCAA 780 

TGQGCTGAAC AGCACAGAGC CCCTCGTGAA GGAGTATGCT GCGTTTGTQC TGGGCGCTGC 840 

CTTTTCCAGC AACCCCAAGG TCCAGGTGGA GGCCATCGAA QGGGGAGCCC TGCAGAAGCT 900 

30 

GCTGGTCATC CTGGCCACGG AGCAGCCGCT CACTGCAAAG AAGAAGGTCC TGTTTGCACT 960 

GTGCTCCXrrG CTGCGCCACT TCCXTTATGC CCAGCGQCAG TTCCTGAAGC TCGQGGQQCT 1020 

35 GCAGGTCXTTG AGGACCCTGG TGCAOGAGAA GGGCACGGAG GTGCTCGCX:G TGCGCOTGGT 1080 

CACACTGCTC TACGACCTGG TCAOGGAGAA GATGTTCGCC GAGGAGGAGG CTGAGCTGAC 1140 

CCAGGAGATG TCCCCAGAGA AGCTGCAGCA GTATCGCCAG GTACACCTCC TGCCAGGCXTT 1200 

40 

GTQGGAACAG GGCTGGTGCG AGATCACXSGC CCACCTCCTG GCGCTGCCCG AGCATGATGC 1260 

CCGTGAGAAG GTGCTGCAGA CACTGGGCGT CCTCCTGACC ACCTGCCGGG ACCGCTACCX5 1320 

45 TCAGGACCCC CAGCTCGGCA GGACACTGGC CAGCCTGCAG GCTGAGTACC AGGTGCTGGC 1380 

CAGCCTGGAG CTGCAOGATG GTGAGGACGA GGGCTACTTC CAGGAGCTGC TGGGCTCTGT 1440 

CAACAGCTT6 CTGAAGGAGC TGAGATGAGG CCCCACACCA GGACTGGACT GGGATGCCGC 1500 

50 

TAGTGAGGCT GAGGGGTGCC AGCXTTGGGTG GGCTTCTCAG GCAGGAOGAC ATCTTGGCAG 1560 

TGCTGGCTTG GCXrATTAAAT GGAAACTTGA AGGCXIAAAAA AAAAAAAAAA AAAAAAAAAA 1620 

55 AAAA 1624 

60 (2) INFORMATION FOR SEQ ID NO: 171: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2003 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: double 

(D) TOPOLOGV: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

10 GGCACGAGCC AGCTTGCAGG AGGAATCGGT GAGGTCCTGT CCTGAGGCTG CTGTCCGC3GG 60 

CCGGTGGCTG CCCTCAAGGT CCCTTCCCTA GCTGCTGCGG TTGCCATTGC TTCTTGCCTG 120 

TTCTQGCATC AGGCACCTGG ATTGAGTTGC ACAGCTTTGC TTTATCCGGG CTTC3TGTGCA 180 

15 

GGGCCCGGCT GGGCTCCCCA TCTGCACATC CTGAQGACAG AAAAAGCTQG GTCTTGCTGT 240 

GCCCTCCCAG GCTTAGTGTT CCCTCCCTCA AAGACTGACA GCCATOGTTC TGCACQGGGC 300 

20 TTTCTGCATG TGACGCCAGC TAAGCATAGT AAGAAGTCCA GCCTAGGAAG GGAAGGATTT 360 

TGGAGGTAGG TGGCTTTGGT GACACACTCA CTTCTTTCTC AGCCTCCAGG ACACTATGGC 420 

CTGTTTTAAG AGACATCTTA TTTTTCTAAA GGTGAATTCT CAGATGATAG GTGAACCTGA 480 

25 

GTTGCAGATA TACCAACTTC TGCTTCTATT TCTTAAATGA CAAAGATTAC CTAGCTAAGA 540 

AACTTCCTAG GGAACTAGGG AACCTATGTC TTCCCTCAGT GTGGTTTCCT GAAGCCAGTG 600 

30 ATATQGGGGT TAGGATAGGA AGAACTTTCT CGGTAATGAT AAGGAGAATC TCTTGTTTCC 660 

TCCCACCTGT GTTGTAAAGA TAAACTGAOS ATATACAGGC ACATTATGTA AACATACACA 720 

CGCAATGAAA CCGAAGCTTG GCGGCCTGGG CGTGGTCTTG CAAAATGCTT CCAAAGCCAC 780 

35 

CTTAGCCTGT TCTATTCAGC GGCAACCCCA AAGCACCTGT TAAGACTCCT GACCCCCAAG 840 

TGGCATGCAG COCCCATGCC CACCQGGACC TGGTCAGCAC AGATCTTGAT GACTTCCCTT 900 

40 TCTAGGGCAG ACTGGGAGGG TATCCAGGAA TCGGCCCCTG CCCCACGGGC GTTTTCATGC 960 

TGTACAGTGA CCTAAAGTTG GTAAGATGTC ATAATGGACC AGTCCATGTG ATTTCAGTAT 1020 

ATACAACTCC ACCAGACCCC TCCAACCCAT ATAACACCCC ACCCCTGTTC GCTTCCTGTA 1080 

45 

TGGTGATATC ATATGTAACA TTTACTCCTG 'TrTCTGCTGA TTGTTTTTTT AATGTTTTGG 1140 

TTTGTTTTTG ACATCAGCTG TAATCATTCC TGTGCTGTGT TTTTTATTAC CCTTGGTAGG 1200 

50 TATTAGACTT GCACTTTTTT AAAAAAAGGT TTCTGCATCG TGGAAGCATT TGACCCAGAG 1260 

TGGAACGCGT GGCCTATGCA GGTGGATTCC TTCAGGTCTT TqCTTTGGTT CTTTGAGCAT 1320 

CTTTGCTTTC ATTCGTCTCC CX3TCTTTGGT TCTCCAGTTC AAATTATTQC AAAGTAAAGG 13B0 

55 

ATCTTTGAGT AGGTTCGGTC TGAAAQCSTCT GGCCTTTATA TTTGATCCAC ACACGTTQGT 1440 

CTTTTAACCG TGCTGAGCAG AAAACAAAAC AGGTTAAGAA GAGCCGGGTG GCAGCTGACA 1500 

60 GAGGAAGCCG CTCAAATACC TTCACAATAA ATAGTGGCAA TATATATATA GTTTAAGAAG 1560 
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GCTCTCCATT TCGCATCGTT TAATTTATAT 
AITTTCATCC TGCAAGCAAC TCAAAATATT 
AAATCTTTGC TTGATAAGTA TTAAGAAATA 
TTGATTTTGT TTCCGTTTGG ATTTTTGGGG 
GAAGTCTGAG ACCTTCCGGT QCTGGGAACA 
CTGCGCATGT CTCTGATGCT TTGTATCATT 
AACAGTATTA TCAAAGAGAA AAAAAAAAAA 
TTCGCCCTAT AGTGAGCCNA TTC 



423 

GTTATGTTCT AAGCACAGCT CTCTTCTOCT 
TAAAATAAAG TTTACATTGT AGTTATTTTC 
TTGGACTTGC TCCCGTAATT TAAAGCTCTG 
GAGGGGftGCA CTGTGTTTAT GCTGGAATAT 
CACAAGAGTT GTTGAAAGTT GACAAGCAGA 
CTTGAGCAAT CGCTCGGTCC GTGGACAATA 
AAAAAACTCG MGGGGGGGCC CGGTACCCAA 



(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 786 base pairs ^ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 
GGCACAGCGG CACGAGAAGA CTTTGGTGTT TAAGAGATTA ATGTGTTAGC CAGAACAACT 
CATTTCTCTA CCMGTGTGTA GTCCATTTAT CTTTAAAGAT TTTCTATTGG AATAATTTTG 
AAATTACTTT CTTAGTTTTC TTCATTAAAA ACTAAGAAAA TGCTTTGTTT ATTATGAATT 
GCTATTTCTC TTGATTATTA TTCTT>3GAGA AAGTCTATCA GACGTAATTC TTCTGATTTG 
CTTCTAGGCT AGAGGAAAAT GTGAAAGATG ACAAATGAAA ATTTCAAAGG TTGTCAGTAG 
TATGACTTCT TTTATCGTTT GTCATTATCA CAAATATATC AACATAGGAC TTTTAAAAGA 
TATTTTGTAC ATATTGGGCC TTAGTAGGAT TTTGCATGAA TTr mTITl ' CTTTTATGCC 
CAGAGAGAAA GAGCAAAGAA ATAACCAAGG GTGATGTACT CGTATTGAAG GTTTACCAAA 
TAAGGACTGC TTTTATTATG AACTATAGTC TATATTCTAA GTAAATCAAT TTTTCTATTA 
TGTGTTTTTT GTrCXTTGCAG GCAAGATCTC TGAACTTTAT QCAGAGQGTT CTTTTAAAAA 
AACAAAGTTG AATTTTTTTA TTTCTTGGAA TATTTTTTTT CATTGATTTC TCCCAAGTAG 
AGCAGATTCA AATCTCCTTT GTADCCTATG TCTTTTTTGT TTTGCTATTA GCTCAGTATT 
CCGTTTCTAC ATTTTCCTTT CCTAGAACCA GTCAATAAAT GACAAAAAAA AAAAAAAAAA 
ACTCGA 



wo 98/54963 



PCTAJS98/11422 



424 



(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1758 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEX2NESS : double 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



GGGACGAGCC CTGCCCACCT CCTGCAGCCT CCTGCGCCCC GCCGAGCTGG CGGATGGAGC 60 



TQCGCACGGG GAGCGTGGGC AGCCAGGCGG TGGCGCGGAG GATGGATGGG GACAGCCGAG 120 
ATGGCGGCGG CGGCAAGGAC GCCACCGGCT CGGAGGACTA CGAGAACCTG CCGACTAGCG 180 



CCTCCGTGTC CACCCACATG ACAGCAQGAG CGATGGCCGG GATCCTGGAG CACTCGGTCA 240 



20 TGTACCCGGT GGACTCGGTG AAGACACGAA TGCAGAGTTT GAGTCCAGAT CCCAAAGCCC 300 



AGTACACAAG TATCTACGGA GCCCTCAAGA AAATCATGCG GACCGAAGCT TCTGGAGGCC 360 



CTTGCGAGGC GTCAACGTCA TGATCATGGG TGCAGGGCCR GCCCATGCCA TGTATTTTGC 420 

25 

CTGCTATGAA AACATGAAAA GGACTTTAAA TGACGTTTTC CACCACCAAG GAAACAQCCA 480 



CCTAGCCAAC GGTATTTTGA AAGCGTTTGT CTGGAGTTAG AAAGTTCTCT TCTTCAACAC 540 
30 GTCCCTCCCC AGGGTGTTCC TCCCTGTGAC CCAGCCGCCT CX3ACTTCGGC CCGCTTGCTC 600 



ACGAATAAAG AACTCAGAGT TGTGTGTGCA ATGCACACCC AGACACACGC ACGCACACAC 660 



ACGCGCGCGC ACACACATGC TTTTTTCTGT TCCCCTCCGC TTTCTGAAGC CTGGGGAGAA 720 

35 

ATCAGTGACA GAGGTGTTTT GGTTTTATTG TTATGTGGGT TTTCTTTTGT ATTTTTTTTG 780 



TTTGTTTTGT TTTTAAACAT TCAAAAGCAA TTAATGATCA GACATAGGAG AAACCCTGAA 840 



40 TAGAAACAAA ACTTTTGAAT GCTGGATTCA AAAAAAAAAA AAAGTTATCT GGACAGCTTC 900 



TTTGAGACTA TTTAAAAACT QGTACAACAG GTCTCTACAA CGCCAAGATC TAACTAAGCT 960 

TTAAAAQGTC AAGAAGTTTT ATGGCTGACA AAGGACTCGC GCAACGCAGA AGGCCTTTCC 1020 

45 

CACdTAAGC TTCCGGQGAT CTGQGAATTT TACCCCCATT CTCTTCTGTT TGTCTGAGTC 1080 



TCATCTCTCT GCAAGCAAGG GCTGAAATCA TTTTGTTTGG TTGTTTTGAG GGAGAGAGGC 1140 



50 GGGGTGGGGG GGTGCAAATC TGCCAGCAGC TCTTACGTAA GGCATGTTTT ATTGGGGAGG 1200 



GCTGAGCTTT TATTTTCTCC TCTOCAGTQG QGTTGGCTTT TATTGTTTCT TGTTTGQGTT 1260 



TGGAATGGAA ATATGGATAG CAGCATAAAG TACTTTTATT TTGACAAAAT TCATTTTTTT 1320 

55 

CAACAATGGA GACATAGATT TGACCCACAA TAACTTCTCC CCCTCTCTTT TTACTCTGCT 1380 



CAAAAAGCAT CTCTCCTCCC ATTACCCAAC CTTGGTCATA AGTGTGCCTG GCTGGTTEGC 1440 



60 AGATATTTGT TCTGCTTTGT AAAAATTGGC CATEAGTGCA TTTATTGAGA TGATCTCTAA 1500 
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AGAGCTATC3C CCTGACCTAC CCCTGATTCT ATGACATTGG GGCCCTTCTT TTGCTGAAAC X560 

TCCCTTACGT AATGGTTTTA CTCCTTGAAA GAGATTTGAC GGAATCCATT TTATGCCAAG 1620 

5 

TGCTGCCCTG CACTGTTTCT GCAATATGTG GTGTATQCTG TOGTC3ATCTT GCTGGGAATG 1680 

ATTATAACTTG TGTGTGTQGT QGGGGAGTGG GTATTACATG CATTGCTGAA GAGTCAAAAA 1740 

10 AAAAAAAAAA AAACTCX^ 1758 

15 {2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

25 CTGTTAGAAT GCCCAGTTTA CCTGGATGGC AACCCAACAG TGCTCCTGCC CACCTGCCCC 60 

TCAATCCTCC TAGAATTCAG CCCCCAATTG CCCAGTTACC AATAAAAACT TGTACACCAG 120 

CCCCAGGGAC AGTCTCAAAT GCAAATCCAC AGAGTGASMC ACCACCTCGG GTAGAATTTG 180 

30 

ATGACAACAA TCCCTTTAGT GAAAGTTTTC AAGAACGGGA ACGTAAGGAA CGTTTACGAG 240 

AACAGCAAGA GAGACAACGG ATCCAACTCA TGCAQGAGGT AGATAGACAA AGAGCTTTGC 300 

35 AGCAGAGGAT GGAAATGGAG CAGCATGGTA TGGTGGGCTC TGAGATAAGT AGTAGTAGGA 360 

CATCTGTGTC CCAGATTCCC TTCTACAGTT COGACTTACC TTGTGATTTT ATGCAACCTC 420 

TAGGACCCCT TCAGCAGrtCT CCACAACACC AACAGCAAAT GGGGCAGGTT TTACAGCAGC 480 

40 

AGAATATACA ACAAGGATCA ATTAATTCAC CCTCCACCCA AACTTTCATG CAGACTAATG 540 

AGCGAGGCAG GTAGGCCCTC CTTCATTTGT TCCTGATTCA CCATCAATCC CTGTTGGAAG 600 

45 CCCAAATTTT TCTTCTGTGA AGCAGGGACA TGGAAATCTT TCTGGGACCA GCTTCCAGCA 660 

GTCCCCAGTG AGGCCTTCTT TTACACCTGC TTTACCAGCA GCACCTCCAG TAGCTAATAG 720 

CAGTCTCCCA TGTGGCCAAG ATTCTACTAT AACCCATGGA CACAGTTATC CGGGATCAAC 780 

CCAATCGCTC ATTCAGTTGT ATTCTGATAT AATCCCAGAG GAAAAAGOGN AAAAAAAARA 840 

AMAARAAARA ARAAAGGAGA TGATGATGCA GAATTCCACC AAGGCTCC 888 



50 



55 



60 



(2) INFORMATIC»J FOR SEQ ID NO: 175: 
(i) SEQUENCE CHARACTERISTICS: 
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<A) LENGTH: 2379 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: cJouble 

(D) TOPOLjOGY: linear 

5 

(xi) SEQUENCE DESCBXVTIW: SEQ ID NO: 175: 

GC3CAGAGCTA GTGTCSGACTC CATCCCCCTG GAGTC5GGATC ACGNCTATGA CCTCAGTCCX; 60 

10 GACCTGGACT CTGCAATGTC CAGAGCTCTG CCCTCTGAGG ATGAAGAAGG TCAGGAOXSAC 120 

AAAGATTTCT ACCTCCGGGG AGCTGTTGSC TTATCAGGGG ACCACAGTGC CCTAGACTCA 180 

CAGATCCGAC AACTGGGCAA AGCCTGGATG ATAGCCGCTT TCAGATACAG CAAACCGAAA 240 

1 3 

ATATCATTCG CftGCAAAACT CCCACGGGGC CX3GAGCTAGA CACCAGCTAC AAAGGCTACA 300 

TGAAACTGCT QGGCGAATGC AGTAGCflGTA TAGACTCCGT GAAGAGACTG GAGCACAAAC 360 

20 TGAAGGAGGA AGAGGAGAGC CPTCCTCGCT TTGTTAACCT GCATAGTACC GAAACCCAAA 420 

CGGCTGGTGT GATTGACXX3A TGGGAGCTTC TCCAGGCCCA GQCATTGAGC AAGGAGTTGA 480 

GGATGAAGCA GAACCTCCAG AAGTGGCAGC AGTTTAACTC AGACTTGAAC AGCATCK3GG 540 

CCTGGCTGGG GGACACGGAG GAGGAGTTGG AACAGCTCCA GCGTCTGGAA CTCAGCACTC 600 

ACATCCAGAC CATCGAQCTC CAGATCAAAA AGCTCAAGGA GCTCCAGAAA GCTCTGGACC 660 

30 ACCGCAAAGC CATCATCTTC TCCATCAATC TCTGCAGCCC TGAGTTCACC CAGGCTCACA 720 

GCAAGGAGAG CCGGQACCTG CAGGATCGCT TGTSGCAGAT GAATGGGCGC TGGGACCGAG 780 

TGTQCTCTCT GCTGGAQGAG TGGCGGGGCC TGCTGCAGGA TGCCCTGATG CAGTGCCAGG 840 

GTTTCCATGA AATGAGCCAT GGTTTGCTTC TTATGCTGGA GAACATTGAC AGAAGGAAAA 900 

ATGAAATTGT CCCTATTGAT TCTAACCTTG ATGCAGAGAT ACTTCAGGAC CATCACAAAC 960 

40 AGCTTATGCA AATAAAQCAT GAGCTGTTGG AATCCCAACT CAGAGTAGCC TCTTTGCAAB 1020 

ACATGTCTTG CCAACTACTG GTQAATGCTG AACSGAACAGA CTGriTAGAA GCCAAAGAAA 1080 

AAGTCCATGT TATTGGAAAT CX3GCTCAAAC TTCTCTTGAA GGAQGTCAGT CGTCATATCA 1140 

45 

AGGAACTGGA GAAGTTATTA GACCJrGTCAA GTAGTCAGCA GGATTTGTCT TCCTGGTCTT 1200 

CTGCTGATGA ACTGGACACX: TCAGGGTCTG TCAGTCCXrAY ATCAGGAAGG AGCACCCCAA 1260 

50 ACAGACAGAA AACGCCACGA GGCAAGTGTA GTCTCTCACA GCCTGGACCC TCTGTCAGCA 1320 

GTCCACATAG CAGGTCCACA AAAGGTGGCT CCGATTCCTC CCTTTCTGAG CCARGGCCAG 1380 

GTCGGTCCGG CCGCGGCTTC CTGTTCAGAG TCCTCCX3AGC AGCTCTTCOC CTTCAGCTTC 1440 

TCCTGCTCCT CCTCATCGQG CTTGCCTQCC TTGTACCAAT GTCAGAQGAA GACTACAGCT 1500 

GTGCCCTCTC CAACAACTTT GCCOSGTCAT TCCACCXX»T GCTCAGATAC ACGAATGG(X 1560 

60 CTCCTCCACT CTGAACTAAG CAGATGCCAT CTGCAGAAGT GCTGGTAGCA TAAGGAGGAT 1620 
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CGQGTCATAA GCAATCCCAA ACTACCAACA AGAGGACCTT GATCTTGGCG AAAGCCMTCG. 1680 

GTOTGGCAGC TTTAGCCTCC TCCAGATCAC ATGTGTGCAA ATTATGGCTT CAGAGGTQGA 1740 

5 

AGATAAACAG TGAC3GGGGGA ACAAACAGAC AACAAGAAGG TTTGGAAGAA ATCTGGTTTG 1800 

AGACTCTGAA CCTTAGCACT AAGGAGATTG AGTAAGGACC TCCAAAGTTC CCCGGACTCA 1860 

10 TGAATTCTGG GCCCTTGGCC NATTCTGTGC ACAGCCAAGG ACTTCAGTAG ACCATCTGGG 1920 

CAGCTTTCCC ATGGTGCTGC TCCAACCATC AGATAAATGA CX:CTCCCAAG CACCATGTCA 1980 

GTGTCGTACA ATCTACCAAC CAACCAGTGC TGAAGAGATT TTAGAACXTTT GTAACATACA 2040 

15 

ATTTTTAAGA GCTTATATGG CAGCTTCCTT TTTACCTTGT TTTCCTTTGG GGCATGATGT 2100 

TTTAiCCTTT GCTTTAGAAG CACAAGCTGT AAATCTAAAA QGCACTTTTT TTTAGAC3GTA 2160 

20 TAAAGAAAAA CTAGATGTAA TAAATAAGAT CATGGAAGGC TTTATGTGAA AAAAGTTGAA 2220 

TGTTATAGTA AAAAAAAAAG ATATTTATGT ATGTACAGTT TGCTAAAGCC AAGTTTTGTT 2280 

TGTATTGATT TCTTTGCATT TATTATAGAT ATTATAAAAT AAAAAAAAAA AAAAAAAAAG 2340 

25 

TCGAGGGGGG GCCCX3GTACC CAATTCGCCC TATAGTGAG 2379 



30 

(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1348 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY:, linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

40 

GCGCCTTCAC GATGCCGGCG GTCAGTGGTC CAGGTCCCTT ATTCTGCCTT CTCCTCCTGC 60 

TCCTGGACCC CCACAGCCCT GAGACGQGGT GTCCTCCTCT ACGCAGGTTT . GAGTACAAGC 120 

45 TCAGCTTCAA AGGCCCAAGG CTGGCATTGC CTGGGGCTGG AATACCCTTC TGGAGCCATC 180 

ATGGAGGTOA GGGGCAGGGG TGGGGACCGC TATGCCCAGG GTCCCTCAAA GTGCTGGAGG 240 

QGCTGTRACT TGGTGGGGAG TQGGTCTGTC ACAGCCATCC TCTGTCCAGG GTCX3GGCAAG 300 

50 

GCCTQGGACA GTGCCAGGCA CCCCAGGACC CCTTCCAQQC TTGTCTCCTG CTCCACCGCC 360 

TCAACACCCC CCAOCCCTGC CCAAGCTGTT TCTCCTCTGC CTCTCTNNTT CCCTGCOCCA 420 

55 GGACTTCTCT CTTCTCCTCT GCCTCTCCTT GGACCCCTGC CCTTCCTCTA CCTCTGACCT 480 

GTGAACACAC AGACACATGC TCACACACTA AGTCCCARGC ACACMSAAAG GCAATGTGGA 540 

CCAGCACAAA CCTCCACTCT CCCGGCTCCA TCCCARCGGG CCTGTGQCTG GCCATGAAAA 600 

60 
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CTGGGGGCTA CCTGGAGQGA AGCATCCTCA TCCCAGGTGA GTGCSGCACCA GCCCTTCCCT 660 

GTATGTGTGT TGTQGGTGGA AGCAGGCATG AGAGCATCTT AGCCCATAGG TTTGTATTCA 720 

GGGACTTCCA AACCCAGACC TACAAAGAGT GTGTCTTCTA CCAGATCTTG TTCAAAAAAG 780 

GGTTTGTGAT GATGGAACTA CACGATAGAG QGAGTGAGCA AGAACAATGA GGATTAGAGT 840 

GGAGCGTGAA ATAGTCTAGG AGCATGGCTT CCAAAACATA TGCTGTGAGG TCTGTCCACC 900 

TGAGAGTTGG GCXMGGATT TAATTCTGAG CCTCTTAGCA QGCAAAGCAA AGACAGAAAG 960 

CAGATCGGCT GTCGATTTCT GTCTATAAAA TGTCAGTTCT TGGCCGGGTG CQGTGGCTCA 1020 

15 CGCCTGTAAT CCCGGCGCTT TGGGAGGCCA GGGCGGATGG GTCGCGAGGT CAGGAGGTTG 1080 

GAAACCATCC TGGCCGGAAT GGTGAAGCCC TGACTCTACT AGAAGTGCAA AGATTGGCTG 1140 

GGTGTGGTGG CGTGCGCCTG TGGTCCCAGC TTCTCGGGAG GCTGAGGCGG GAGAGTTGCT 1200 

20 

TQGGCCTGGG AGGCCGAGGT TGCGGTGAGC TGAGATCCTG CCATTGCACT TCAGCCTGGG 1260 

CACAGAGCCA GACTCTGGCT CAAAAAAAAA AAAAAAAAAA ACTCGAGGGG GGCCCGTACC 1320 

25 CAATTCGCCG NATATGATC6 TAAACAAT 1348 

30 (2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1502 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEIX^S: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

40 CTCAAAATAA ATAAATAAAT AAAAATTTGT ATTCCATTGA TTTGGGTAGA CACCAGGAAT 60 

GTGCATTTCT AACAAGCTTT CCAQGCGATC CTATAGTAAG TCATCTGTGG ACTACTTTAA 120 

GAAACTCTTC TATAGAGAAT GGAGTTGGAT TAATAATAGG TGATTTTTTA CACTGGACTC 180 

45 

ATTCACAAGA ACCTAAACAG TAGTCCATGA AGCTGCTCAT CTGTGGTAAC TATTTGGCCC 240 

CGTCTCACTC TGAAAGCAGC AGGAGATGTT GTTTACTTTG TTTCTATCCC CTTTGTCTGG 300 

50 AGATTAATTT TQGAATGAAA GTTTTTCTCT CTATGCCATT CCTGGTTCTT TTCCAAAGCC 360 

TCATACAAGA GGATTAGGTC ACAATGCATG CATTACCTTT TAAAAGAATG CGATATTGAT 420 

ACCGATGCTT ACTTTTTTTT TTTTTNACTA CTTGTTTTAT TCCTTCCAGN AAAGTATAGC 480 

CCGCCTTTCT ATAGCATAGT TCTCTTTAGG TQGAATGATT CCTATAAGAT TTCTCATTAT 540 

TAAATCATGC ATTTTTCAAG ATGGAATCAA TMTTTGATTT AATCTAAGCT GATATTCICA 600 

60 TTTGTTAGAA GAACAACCTA CATGCTAGAG AGAGAGGAGG AAATATACCC ACGACCACAC 660 
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AGCCAGTTAG TATCCAGTTG GTGCTGGACT CCAGCCAGGT GTCCTGCXTTC ATGGTAGTTA 720 

AATX5ATATAT AGAAAAGGTA AATTTTTAAA GAAATATTTA TTAATATATT CXTATAAAAC 780 

5 

ATTTTAAAGG TAACCACATA AAAATGGTTA ATTTTTCCAT TCCAAAGTAA ATGCTAAGCA 840 

TOTTTATTAA TGAAGCAGTA CTTCTGATTA CTATATGACA TTCTGAAGTT AATTAAACTC 900 

10 ATTGCACTAA ATGTGTCTTC CrTGGTATAG TGGAGGATTT GAGGATTGGA ATATAGAGTA 960 

GAGTGCTTGC TTAAGCCTGG GAGCCCATCT TTATAGCTAT TTGATGTAAG AAAAGAGACA 1020 

TCGNCCATTT CTAAACTATA TAAGGTGAGT GTGTCTATTC CCAGCAGATA TAAAGGAAAA 1080 

15 

AGGAAACTTT TTTGATTCCC ACCTTCCCAG CCTCACCTAG CCATCTTCCA GCCTCAAATA 1140 

TAGAGATCTT AGTGCAAGGT CCTGGGCTCT AGGTGATCAT TTCATAAGTC CTTTACAGAT 1200 

20 AAAGAAAAAG TAGTGTTTGT ATGTTTGTTT TTAAGTAACC CCAAAACAAA TTTATATTGT 1260 

ATTCAGCAAA ATTGGAATTC AGGTGTTTAA TTTTAGAACA TGAAGTGCCT GCTGfTTTTAA 1320 

GCATTCACTT GTATAAAAAG AATTGCATGT CTCCAGTAAG CTTATGGGTT TTCTCATTTT 1380 

25 

TAGGTATATG GCTTTTAATC ATGTAAAGTG AAACATTAGT TTTCTTGCAT TTTATTACAG 1440 

GTTCTTTGTT GCAATAAAGA TGCTGCTGAA ATTAATTGAA AAAAAAAAAA AAAAAAACTC 1500 

30 GA 1502 

35 (2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1637 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEI»3ESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

45 ATTTTCTAGC CCACAAGGAC TGAAGTTCAG ATCCAAAAGT TCACTTGCTA ATTATCTTCA 60 

CAAAAATGGA GAGACTTCTC TTAAGCCAGA AGATTTTGAT TTTACTGTAC TTTCTAAAAG 120 

GGGTATCAAG TCAAGATATA AAGACIGCAG CATGGCAGCC CTGACATCCC ATCTACAAAA 180 

50 

CCAAAGTAAC AATTCAAACT GGAACCTCAG GACCCGAAGC AAGTQCAAAA AGGATGTGTT 240 

TATGOCGCCA AGTAGTAGTT CAGAGTTGCA GGAGAGCAGA GGACTCTCTA ACTTTACTTC 300 

55 CACTCATTTG CTTTTGAAAG AAGATGAGGG TGTTGATGAT GTTAACTTCA GAAAGGTTAG 360 

AAAGCCCAAA GGAAAGGTGA CTATTTTGAA AGGAATCCCA ATTAAGAAAA CTAAAAAAGG 420 

ATGTAGGAAG AGCTGTTCAG GTTTTGTTCM AAGTGATAGC AAAAGAGAAT CTGTGTGTAA 480 

60 
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TAAAGCAGAT GCTGAAAGTG AACCTGTTGC ACAAAAAAGT CAGCTTGATA GAACTGTCTG 540 

CATTTCTGAT GCTGGAGCAT GTGGTGAGAC CCTCAGTGTG ACCAGTGAAG AAAACAGCCT 600 

TGTAAAAAAA AAAGAAAGAT CATTGAGTTC AGGATCAAAT TTTTGTTCTG AACAAAAAAC 660 

TTCTCGCATC ATAAACAAAT TTrGTTCAGC CAAAGACTCA GAACACAACG AGAAGTATGA 720 

GGATACCTTT TTAGAATCTG AAGAAATCGG AACAAAAGTA GAAGTTGTGG AAAGGAAAGA 780 

ACATTTCCAT ACTGACATTT TAAAACGTGG CTCTGAAATG GACAACAACT GCTCACCAAC 840 

CAGGAAAGAC TTCACTGAAG ATACXIATCCC ACGGAACACA GATAGAAAGA AGGAAAACAA 900 

GCCTGTATTT TTGCAGCAAA TATAACAAAG AAGCTCTTAG CCCCCCACGA CGTAAAGCOT 960 

TTAAGAAATC GACACCTCCT CGGTCACCTT TTAATCTCGT TCAAGAAACA CTTTTTCATG 1020 

ATCCATGGAA GCTTCTCATC GCTACTATAT TTCTCAATCG GACCTCAGGC AAAATGGCAA 1080 

TACCTGTGCT TTQGAAGTTT CTGGAGAAGT ATCCTTCAGC TGAGGTAGCA AGAACCGCAG 1140 

ACTGGAGAGA TGTGTCAGAA CTTCTTAAAC CTCTTGGTCT CTACGATCTT CGGGCAAAAA 1200 

CCATTGTCAA GTTCTCAGAT GAATACCTGA CAAAGCAGTG GAAGTATCCA ATTGAGCTTC 1260 

ATCGGATTGG TGCACCCTGA AGACCAGAAA TTAAATAAAT ATCATGACTG GCnTQGGAA 1320 

AATCATGAAA AATTAAGTCT ATCTTAAACT CTGCAGCTTT CAAGCTCATC axnTATGCAT 1380 

AGCTTTGCAC TTCAAAAAAG CTTAATTAAG TACAACCAAC CACCTTTCCA GCCATAGAGA 1440 

TTTTAATTAG CCCAACTAGA AGCCTAGTGT GTGTGCTTTC TTAATGTGTG TGCCAATGGT 1500 

GGATCnrcC TACTCAATGT GTTTGAACAT GTTTTGAGAT TTTTTTAAAA TAAATTATTA 1560 

TTTGACAACA ATCCAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1620 

AAAAAAAAAA AAAAAAA ^637 



45 



50 



55 
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(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2911 base pairs 

(B) TYPE: nucleic acid . 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 
GGTQGTTTTT GTTCTGCAAT AOGCGGCITA GAGGGAGGQG CTTTTTCGCC TATACCTACT 
CTAGCTTCTC. CACGTATGGA CCCTAAftGGC TACTGCTGCT ACTACGQGGC TAGACAGTTA 
CT G TCTCAGC TCTAGGATGT GCGTTCTTCC ACTAGAAGCT CTTCTGAGGG AGGTAATTAA 
AAAACAGTGG AATCGAAAAA CAGTGCTGTA GTCATCCTGT AATATGCTCC TTGTCAACAA 



60 
120 
180 
240 
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TCTATACATT CCTGCTAGGT GCCATATTCA TTGCTTTAAG CTCAAGTCGC ATCTTACTAG 300 

TCAACTATTC TQCCAATGAA GAAAACAAGT ATGATTATCT TCCAACTACT GTGAATGTGT 360 

5 

C5CTCAGAACT GGTGAAGCTA GTTTTCTGTG TGCTTGTGTC ATTCTGTGTT ATAARGAAAG 420 

ATCATCAAAG TAGAAATTTG AAATATGCTT CCTGGAAGGA ATTCTCTGAT TTCATGAAGT 480 

10 C3GTCCATTCC TCCCTTTCTT TATTTCCTGG ATAACTTGAT TGTCTTCTAT GTCCTGTCCT 540 

ATCTTCAACC AQCXATQGCT GTTATCTTCT CAAATTTTAG CATTATAACA ACAGCTCTTC 600 

TATTCAGGAT AGTOCTGAAG ANGCGTCTAA ACTGGATCCA GrGGGCTTCC CTCCTGACTT 660 

TATTTTTGTC TATTCTGGCC TTGACTGCCX3 GGACTAAAAC TTTACAGCAC AACTTGGCAG 720 

GACGTOGATT TCATCACGAT GCCTTTTTCA GCCCTTCCAA TTCCTGCCTT CiTITCAGAA 780 

20 ATGAGTCTCC CAGAAAAGAC AATTGTACAG CAAAGGAATG GACTTTTCCT GAAGCTAAAT 840 

GGAACACCAC AGCCAGAGTT TTCAGTCACA TCCGTCTTGG CATGGGGCAT GTTCTTATTA 900 
TAGTCCAGTG TTTTATTTCT TCAATGGCTA ATATCTATAA TGAAAAGATA CTGAAGGAAG . 960 

GGAACCAGCT CACTCAARGC ATCTTCATAC AGAACAGCAA ACTCTATTTC TTTGGCATTC 1020 

TCTTTAATGG GCTCACTCTG GQCCTTCAGA . GGAGTAACCG TGATCASATT AAGAACTGTC 1080 

30 GATTTTTTTA TGGCCACAGT GCATTTTCAG TAGCCCTTAT TTTTGTAACT GCATTCCAGG 1140 

GCTTTTCAGT GGCTTTCATT CT6AAGTTCC TGGATAACAT GTTCCATGTC TTGATGGCCC 1200 

AGGTTACCAC TGTCATTATC ACAACAGTGT CTGTCCTGGT CTTTGACTTC AGGCCCTCCC 1260 

TGGAATTTTT CTTXSGAAGCC CCATCAGTCC TTCTCTCTAT ATTTATTTAT AATGCCAGCA 1320 

AGCCTCAAGT TCCGGAATAC GCACCTAGGC AAGAAAGGAT CCGAGATCTA AGTGGCAATC 1380 

40 TTTCGGAGCG TTCCAGTCGG GATGGAGAAG AACTAGAAAG ACTTACCAAA CCCAAGAGTG 1440 

ATCAGTCAGA TGAAGATACT TTCTAACTGG -rACCCACATA GTTTGCAGCT CTCTTGAACC 1500 

TTATTTTCAC ATTTTCAGTG TTTGTAATAT TTATCTTTTC ACTTTGATAA ACCAGAAATG 1560 

TTTCTAAATC CTAATATTCT TTGCATATAT CTAGCTACTC CCTAAATGGT TCCATCCAAG 1620 

GCTTAGAGTA CCCAAAGGCT AAGAAATTCT AAAGAACTGA TACAGGAGTA ACAATATGAA 1680 

50 GAATTCATTA ATATCTCAGT ACTTGATAAA TCAGAAAGTT ATATGTGCAG ATTATTTTCC 1740 

TTOGCCTTCA AGCTTCCAAA AAACTIGTAA TAATCATGTT AGCTATAGCT TGTATATACA 1800 

CATAGAGATC AATTTGCCAA ATATTCACAA TCATCTAGTT CTAGTTTACA TGCCAAAGTC 1860 

TTCCCTTTTT AACAJTTATAA AAGCTAGGTT GTCTCTTGAA TTTTGAGGCC CTAGAGATAG 1920 

TCATTTTCCA ACTAAAGAGC AACGGGACCC TTTCTAAAAA GGTTGGTTGA AGGACCtAAA 1980 

60 TACCTGGCCA TACCATAGAT TTGGGATGAT GTAGTCTGTG CTAAATATTT TGCTGAAGAA 2040 
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GCAGTTTCTC AGACACAACA TCTCAGAATT TTAATTTTTA GAAATTCATG GGAAATTGGA 2100 

TTTTTGTAAT AATCTTTTGA TCTTTTAAAC ATTGGTTCCC TAGTCACX:AT AGTTACCACT 2160 

TGTATTTTAA GTCATTTAAA CAAGCCACGG TQGGGCTTTT TTCTCCTCAG TTTGAGGAGA 2220 

AAAATCTTGA TGTCATTACT CCTGAATTAT TACATTTTC3G AGAATAAGAG GQCATTTTAT 2280 

TTTATTAGTT ACTAATTCAA GCTGTGACTA TTGTATATCT TTCCAAGAGT TGAAATGCTG 2340 . 

GCTTCAGAAT CATACCAGAT TGTCAGTGAA GCTGATGCCT AGGAACTTTT AAAGGGATCC 2400 

TTTCAAAAQG ATCACTTAGC AAACACATGT TGACTTTTAA CTGATGTATG AATATTAATA 2460 

CTCTAAAAAT AGAAAGACCA GTAATATATA AGTCACTTTA CAGTGCTACT TCACACTTAA 2520 

AAGTGCATCG TATTTTTCAT GCTATTTTGC ATGCAGCCAG TTAACTCTCG TAGATAGAGA 2580 

ACTCAGGTGA TAGATGATAT TAAAAATTAG CAAACAAAAG TGACTTGCTC AGGGTCATGC 2640 

AGCTGGGTGA TGATAGAAGA GTGGGCTTTA ACTGGCAGGC CTGTATGTTT ACAGACTACC 2700 

ATACTGTAAA TATGAGCTTT ATGGTGTCAT TCTCAGAAAC TTATACATTT CTGCTCTCCT 2760 

TICTCCTAAG TTTCATGCAG ATGAATATAA GGTAATATAC TATTATATAA TTCATTTGTG 2820 

ATATCCACAA TAATATCACT GGCAAGAATT QGTGGAAATT TGTAATTAAA ATAATTATTA 2880 

AACCTAAAAA AAAAAAAAAA AAAAACTCGA G 2911 



35 (2) INFORMATION FOR SEQ ID NO: 180: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

GGCACGAGCC CCAGGCCAGC CAGGGCCAGG CCTACTTTGG CCACCCTTAA ATTAGAATGT 60 

GQGGTCAGGG GTCACAGAAA AGCCATTTCT CTGACCTAGT GTTTGGCGTC CGGGAACTCT 120 

GTCCCCAACC TTCAGACCCT GGCAGTCCTC ACTGAGGCCA TTGGCCCAGA GCCCGCXMX: 180 

CCCCGARACC CCCQQGAGCC GCCTGTTQCC ACGTCCACAC CTGCCACACC CTCTGCCGQG 240 

CCCCAGCCCC TCCCAACOGG GACCGTGCTG GTCCCTGGGG GTCClGCCXr ACCTTGCCTT 300 

GGGGAGGCAT GGGCCCTCCT CCTCCCACCC TGCCGGCCGT CACTCACCTC TTGCTTCTGG 360 

TCCCCCAGGC CTAGCCCTTG GAAGGAGACA GGAGTCTAGG GAGGCTGAAG CCCACTCCCG 420 

GGGAGGCCCG TGCICCTCCA GCCCCAGQGA CAGCAAGGAA AAGAGAAGAG AGCAGAGCAT 480 
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TTCATGGCTC TAATAAAAAA AAAAAAAAAA AAAACTCGA 519 



(2) INFORMATION FOR SEQ ID NO: 181: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 968 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doioble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

TCCCCTTCGG GCCGGAAAAA GCGQGGTTGG CCTGNCCATT GGTTNTCCAT GCCGCCCGCC 60 

CATGCCCCAG TACTAGCCTG CAGTCCCAAT GTAGCCCCTC CCTCYTCCMA GAGCCCYTCM 120 

20 AACCGCCCCG STCANTTCTG ATTTCAGGAG GATTTGATGA AGATGTTAAA GCGAAAGTGG 180 

AGAACCTTCT CGGGATTTCC AGCCTGGAAA AAACGGACCC TGTTAGGCAA GCACCCTGCA 240 

GCCCTCCCTG TCCCCTTCTT CCCCTCCCCT TCYCCCGCCC GTGGAGACAG CTGTTYTCAG 300 

CAGGGdCTC CGCAGGGAGG GGGCCGGCTC CTTCCCTGGC AGCAACATCC TTGCCCTTGT 360 

CACACAAGTC AGCCTCCATC TGCGCAGCTC TGTQGA1GCG CTGCTGGAGG GCAACAGGTA 420 

30 TOTCACTGGC TGGTTCAGCC CCTACCACCG CCAGCGGAAG CTCATCCACC CGGTCATGGT 480 

TCAGCACATC CAGCCCGCAG CGCTCAGCCT CCTGGCACAG TQGAGCACCC TCGTGCAGGA 540 

GCTGGAGGCT GCCCTCCAGC TGGCTTTCTA CCCGGATGCC GTGGAGGAGT GGCTGGAGGA 600 



50 



(2) INFORMATION FOR SEQ ID NO: 182: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1128 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



660 



AAACCyrCCAC CCCAGCCTGC AGCGGCTGCA ARCTCTGCTG CAGGACCTCA GCGAGGTGTC 

TGCCCCCCCG CTCCCACCCA CCAGCCCTGG CAGGGACGTT GCTCAGGACC CCTGAGGGGA 720 

40 GAGCTCATGC CAGGGGGCTC CTGCTGGAGG CTGGGGGGGC TCTGCWYTKY CWWWTGGCCT 780 

GQGCAATACG GCCCACGTOG GCGTCGTGCC CTCTGGCCCA GCAGTGTCTT GCCCACACTC 840 

ACTTCCTCAG GGCCCTGGGC AQCCCCTGGG GGAGAGACTA GAAAACACAG AAGGAAGCAG 900 

CACAGQGAGA CCCGCTTTGT GATCTGCATG TGTGACACTG ATTCTTTGGA AATAAAGAGT 960 
GGAAGCTG 



968 



60 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 182: 

TCTAAAAGTT ATCAGTAATC CTAATTCTTT TCCTGGGTTT TCCTTTTGTC ACTTATTAAT 60 

aM?rTTITCA AAGGACGAAT GAATTTAGAG ATGTACTCTG GAGCAGTATC ATGTTAAACC 120 

AGGQGTATAT TAGAAAAATC ATCCTCATAA TCATTCTGGG AAGTTTTTCC TCCCCAAAAA 180 

AAGCCATCCT GATGGGTTTT CAAAACCAGA AAAAAGCTCT TAATCAGGAA CAGACCACTG 240 

GAC3TACCCAT GAGCATCTCA GGAAAACTGA GACCCTCGAG AAGCCTTGAT TTCGTGCAAC ' 300 

CCCCAAGGTT TCAGAGCCAG CAGCCCAGTG CTGTGGTTGA CAGACGTGGT TTTKTGGRGA 360 

15 AAGCAGCCAG AGGCCAGGAA TTTTCAGAGT CGTGAGTCAC GRTYTCCCAC CCAAGATTAG 420 

AGCAMAGATT AGCCATACTG AGATTTGGTA AAATCAOTCT GTCTAAGCAA TGGAGGTGTG 480 

TX3CAMACGTG CAGTGCCTGT TCACAGGGGA TGCAGGCAGA TCSYGGGTTT AGGATQGGGR 540 

AGGCCACCGC ACCCCCYTTC AYTGCTCTGC ACCTGCTCCC TCACGTGGAC ACTGTCCACA 600 

ACTGTCGCTC TCACAGGACA GTTGCCCAAG GAGCTCATAT CTTATTGGAG ATAGGGGGTC 660 

25 CTACAGGTCA CATTCATGAG CAGTGTGAGC CGGGTCACAT GGGGGTIGTCA ACCCAGCATC 720 

TGTCCAGGAG CTCCTCCTGC AGCGGCTCTG GCAGGTQGCC TGAGGCTCCT TTTTGAGAGA 780 

GAACTCTTTG GCCTTCCTGT CTCCTCTCCT CTGATCTGTT CTTTCTTGGA ACACCACCCA 840 

AGAACGTCAC CTCCTCCATC AGATTGTGAG CTCCTQGAGG GCAGGAGCTG TGTCCTTCTA 900 

TTCATCTTCC TATCCCCAGA ACCTTGCACA GATCCTGGAA TGTGGTAGGT GCTCAGTAAA 960 

35 TGTCTOTTGA ATAAATGAAT GAATGAATGA ACAAATGAAT GAATTTGCTT ACTTCAAGGC 1020 

AAAAGAACCA TGAAACTGTA TTTTGAGTTT CTATGTTATA GCAGTCAGCA AATCCTATTA 1080 

AATACTTTGT GTTTCCAAGC AAAAAAAAAA AAAAAAAAAA AAACTCGA 1128 

40 



20 



30 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 183: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2276 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

CCGCQGOGTC TCACCTCATG GCGTAGAGCC TAGCAACAGC GCAGGCTCCC AGCCGAGTCC 60 

GTTATGGCCG CTOCCGTCCC GAAGAGGATG AGGQGQCCAG CACAAGCGAA ACTGCTGCCC 120 

GGGTCQGCCA TCCAAGCCCT TGTQGGGTTG GCGCGGCOGC TOGTCTTGGC GCTCCTGCTT 180 

60 GTGTCCGCCG CTCTATCCAG TGTTGTATCA CGGACTGATT CACCGAGCCC AACCGTACTC 240 
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AACTCACATA TTTCTACCCC AAATGTGAAT GCTTTAACAC ATGAAAACCA AACCAAACCT 300 

TCTATTTCCC AAATCAGCAC CACCCTCCCT CCCACGACGA GTACCAAGAA AAGTGGAGGA 360 

5 

GCATCTOTGG TCCCTCATCC CTCGCCTACT CCTCTGTCTC AAGAGGAAGC TGATAACAAT 420 

GAAGATCCTA CTATAGAGGA GGAGGATCTT CTCATGCTGA ACAGTTCTCC ATCCACAGCC 480 

10 AAAGACACTC TAGACAATGG CGATTATGGA GAACCAGACT ATGACTGGAC CACGGGCCCX: 540 

AGGGACGACG ACGAGTCTGA TGACACCPTG GAAGAAAACA QGGGTTACAT GGAAATTGAA 600 

CAGTCACJTGA AATCTTTTAA GATGCCATCC TCAAATATAG AAGAGGAAGA CAGCCATTTC 660 

15 

TTTTTTCATC TTATTATTTT TGCTTTTTGC ATTGCTGTTG TTTACATTAC ATATCACAAC 720 

AAAAGGAAGA TTTTTCTTCT GGTTCAAAGC AGGAAATGGC GTGATCGCCT TTGTTCCAAA 780 

20 ACAGTCGAAT ACCATCGCCT AGATCAGAAT GTTAATGAGG CAATGCCTTC TTTGAAGATT 840 

ACCAATCATT ATATTTTTTA AAGCACTGTG ATTTGAATTT GCTTATTCTAA TITTATTTGC 900 

TTCACTTTTT ATATCATATT GTGCAAATGT TTGCCATAGG CAATTGCTAC TTAAATGAGA 960 

25 

GC?rGAjMCrC TCTTTTCCCT TQGTGCTTTG GAAATTAAAT GTCACAAACG AGTATATAAT 1020 

TTTTTATCTC TACTTTTAGA GCTGAOrrTA ATCAGGTGTC CAAAATGTGA GTTAAACATT 1080 

30 AOCTTATATT TACACTGTTA GrPTTTTATTG TTrTAGATTT ATTATGCTTC TTCTC3GAAGT 1140 

ATTAGTGATG CTACTTTTAA AAGATCCCAA ACTTGTAACT AAATTCTGAC ATATCTGTTA 1200 

CTCCTCACTC ACATTCATTC TCCX3CCATTC AAATACTATT TTTTATCCAC ATTTTTTTTT 1260 

35 

GTTCCCAAAC TCTAATOTAC AAGGATATGT GTGATAATC5C TTTGGATTTG AGTAATATTT . 1320 

TTTTTTCTTC CAAGAAAACT GCTTTGGATA TTTTTAGATA ATTTAAACAT AATTTAGGAT 1380 

40 AATGATATTG CTCAATCTGA CCACAATTTT AGGTAAAACA TTAAATGTGT CAGAAATCTT 1440 

GQCAACAGAG ACTCTCCAGC TTQCAGTGGA CATAGATAAA ATGTTACAGA GATACTATTT 1500 

TTTTCGTTGG AATTACTATA TTAAATTTAG AAGCAGAAAC TGGTAAAATG TTAAATACAT 1560 

45 

GTACAATTGC TTrTAGTTAG CAATTGATTG TAGCATGGGT TCCTCCAAGG TTTCAAGCAA 1620 

TGGGCAGAGT TTAAAATTAT ATCAGATTCG TTTACTTCGT TTATTATTTT ACAGTAAATT 1680 

50 TGAATAAATC TTAGGGGTCA TTAICACTTA AATAATACTG TACCTAQGTC TTTCAAATTA 1740 

AAATTATACC TCAATOAACT TGTTTGTATA CATAAAGGAT ATTTGTGTAC AATTACCTTT 1800 

TTTCCCCTAC ACTTCTTTTC Ti ' m ^ nTlTC TTTTTTATGG CAACTGGAAA GTATTTACTA 1860 

55 

TCGGATTCAT TTATC?rCrGT CPTTCTATCA TAAAGAATTG ATCAATATGT AAATATGTGA 1920 

ITTGAACCAT GGTTCACTTA CAAGTGTCAC TACAGCTTTT TAGAAAACAT AGCCCTAATA 1980 

60 TATCTTAAGC AQGACCCGGG TGAGCCAGTG QGCTTGCGCT TTATGTAGAG CTGGAAGAAG 2040 
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GCCGICCATC CTOTCTCrrG GGCGGSCAGT GTACmCCT AATAGGGAAG GGAAGCACAA 
tGGAAATACC CCTOAACCCyr TTTATIGCAG TAATTrTTTT CATATCTGAA ACTATTXTTT 



2100 
2160 



^ AATArmGAATAAGATm AAAA«^TAAATC«3^ 2220 



AAAAAAAAAA 



2276 



10 



(2) INFORMA.TI0N FOR SEQ ID NO: 184: 



J 5 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTICW: SEQ ID NO: 184: 
TCa«GCrAC GCCACTOCXSG CIGGGGCCTT GGGAGCGGGA 
25 GGCGGCGCn«AGAAGAGCGAGGCGKAGGAGGGGGa^TQGCCGGGCAGa«^^^ 
ftCGATGACAG TOSGAACACC TTCITCTACr TCCTCACCTC CTTOGnOGGG CTCATCCTGA 
TCCCOGCGAC ATACTACCIC TOXXXCGAG ATCAGAATGC CGAGCAAATT CGATTAAAGA 
ATA-TCAGAAA AGTATAIGGA AGC?rOTATCT aSTACGTTTA OGGTTATTAA AACCCC«3CC 
AAATA-rrKTr CCTACAGTAA AGAAAATAGT TCTGCPTGCA GGATGGGCAT TGTICTTATT 
35 CCTOXXaTAT AAAOmCCA AAACACa«X» AGAATACCAA GAATAC^^ 

ATTAAATTTG GATCCTGGAG CCACAOTAGC AGAAATTAAA AAACAATATC GTnGCTGTC 
ACITAAATAT CATCCAGATA AAQGAGOTGA TCAGGTTATG TTCATOAGGA TACCAAAAGC 
TrATGCTGCTTTAACGGATGAAC»GrCCCG(»AAAATIGGG^ 
TCGGCCTCAA GCCACAAGCT riGGAATT^: CCTGCCfiGCT TGGATAGTTC ACCAGAAAAA 
45 ^TICAAO^ CnrrrACTTC TATATGGArr GGCATITATG OrTATO^ 

GQGCTCriGG TGOTATCGCr CAATACGCTA TACnOGAGAC CAGATTCTAA TACGSACAAC 
ACAGATTTAT ACATAdTIG TTTATAAAAC CCGAAATATG GATATGAAAC OICITATCAT 
GSmTGGSr GGAGCircrG AATITOKICC TCAGT^^^ 
AACGGATAAT ATTCPAATAC CAO^SCTAAT CAGAGAAATT 
55 GAATCAGCCT CCACPTACCT GCCCATATAG OCTGAAGGCC AGAGTrcm TACIGICTCA 
TCTTGCTAGA ATGAAAATIC CIGAGACCCT TGAAGAAGAT CAGCAATTCA TGCTAAAAAA 
OlorCCTOCC CTACrrCAAG AAATGCTTAA TGTAATCI«C CAACTAATAG TAAOOXCCG 

60 
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GAACCX3TCAA GAAAGGGAGT TTCGTGCTCC AACTTTGGCA TCCCTiVGAAA ACTGCATGAA 1200 

GCTTTCTCAG ATGGCCGTTC AQGGACTTCA GCAATTTAAG TCTCCCCTTC TGCftGCTCCC 1260 

5 TCATATTTCAA GAGGACAATC TTAGACGGGT TTCTAATCAT AAGAAGTATA AAATTAAAAC 1320 

TATCCAGGAT TTGGTGAGTT TAAAAGAATC AGATCCTCAC ACTCTACTGC ACTTCCTTGA 1380 
AGATGAAAAA TATGAAGAGG TTATGGCTGT CCTTGGGAGT TTTCCATATG TGACCATGGA . 1440 

10 

TATAAAATCA CAGGTGTTAG ATGATGAAGA TAGCAACAAC ATCACAGTAG GATCCTTAGT 1500 

TACAGTGTTG GTTAAGTTGA CAAQGCAAAC AATGGCTGAA GTATTTGAAA AGGAGCAGTC 1560 

15 CATCTGTCCT GCAGAGGAAC AGCCAGCAGA AGATGGGCAG GGTGAAACTA ACAAGAACAG 1620 

GACAAAAGGA GGATGGCAAC AGAAGAGTAA AGGACCCAAG AAAACTGCTA AATCAAAAAA 1680 

AAAGAAACCT TTAAAAAAAA AACCTACACC TGTGCTATTA CCACAGTCAA AGCAACAGAA 1740 

20 

ACAAAAGCAG GCAAATGGAG TCGTTGGGAA TGAAGCTGCA GTAAAGGAAG ATGAAGAAGA 1800 

AGTTTCAGAT AAGGGCAGTG ATTCTGAAGA AGAAGAAACC AATAGAGATT CCCAAAGTGA 1860 

25 GAAAGATGAT GGTAGTOACA GAGACTCTGA TAGAGAGCAA GATGAAAAAC AAAACAAAGA 1920 

OGATCAAGCA GAGTGGCAAG AATTACAACA AAGCATACAG CX3AAAAGAGA GAGCTCTATT 1980 

GGAAACCAAA TCAAAAATAA CACATCCTGT GTATAGCCTT TACTTTCCTG AGGAAAAACA 2040 

30 

AGAATCGTGG TGGCTTTACA TTGCAGATAG GAAGGAGCAG ACATTAATAT CCATGCCATA 2100 

TCATGTGTGT ACGCTGAAAG ATi^ZAGAGGA GGTAGAGCTG AAGTTTCCTG CACCAGGC/^ 2160 

35 GCCTOGAAAT TATCAGTATA CTGTGTTTCT GAGATCAGAC TCCTATATQG GTTTGGATCA 2220 

GATTAAACCA TTCGAAGrTTK GGAAGTTCAT GAGGCTGAAG CCTGTGCCAG AAAATCACCC 2280 
ACAGTGGGAT ACAGCAATAG AGGGGGATGA AGACCAGGAG GACAGTGAGG GCTTTGAAGA 2340 

40 

TAGCTTTCAG GGAQGAAGAG GGAGGGAGGA AGGAAGGTGG TGGACTTAAG GCAGTTACTC 2400 
TQGAATCGGA CCCACAC3TGT TTTGCACCAT ATTTTGGCAA TTTTTTTTGC CCGTTTmiG 2460 
45 GAAGTGTTTT CCNTNAANCC CAGGAACCAT TACAGAADCG 2500 



50 (2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1337 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRAMDEDNESS: double 

(D) TOPOLOGV: linear 

(xi) SEQUENCE DESCRIPTI<»I : SEQ ID NO: 185: 
60 CTTCCGGTTC TCCQGGCAGC TGCCACTGCT GTAGCTTCTG CCACCTQCCA CGACCGGGCC 60 



wo 98/54963 



PCTAJS98/11422 



438 



TCTCCCTGGC GTTTGGTCAC CTCTGCTTCA 
GCCAGCGTGG CGGGCCTQGC GGCTCCCGGG 

5 

GCCTCGCAGT GCTGCTGCTG TCTCAGCCAC 
CTGCCTGAAC TAAGCGGGYC CCTGCSMAGTC 
10 CTTGGGCCTC CTGACCCTAG ACCACGGACA 

GCCCAGCAGC CGGGCCGTGG TCTCGCTGAA 
AATGGCAGCA ACCCTGTGGC CGGGCTTGAG 

15 

GGCrcOGTGG GTOGCGGCXT TGCTGTGAGC 
CGGGCCCTGA CCGTC3TTGAT GC3TGGTGAGC 
20 ACX3GTCAGGA TGAGAAGAAG AAACXXSAAAG 
ATAGAAAATA TGGAATTGAC ACCTTTAGAA 
TTTGATGCCA ATCATCCTCG AAGATAAGAA 

25 

CTACAATGAA GAGTGGAATT TCTAJTGTTTA 
GGGGGTATTT AAGTTACATA TATTTTAACA 
30 TATCCTTTTA TTATATCTTT ATATGTATAG 
GGGGATAAAG TATACTOTAA TAATTTATCT 
TGATCQGTTT TTCTTTCCTG CTTACCATAT 

35 

TAATGCTAAT TATTTTTGCT GATGTCATAT 

TGGTGTGTAA AAATAATTTA AAArTTCCTT 

40 AAAAGAAGCC AAATTTATTA CTTTGTGTTG 

TTATTGTTTG CAAAACAATA AATATGATTT 

CCGGGGGGG6 GCCCX3GN 
45 



TTCTCCACCXJ CGCCTATGGT CCCTCTTGGA 120 

TOGTCAGAGA GCQGTCCGGG AACGATGAAG 180 

CrCTTGGCTT CCGTCCTCCT CCTGCTGTTG 240 

CTCCTGCAGG CAGCOGAGGC CX3CGCCAGGT 300 

TTACCGCCGC TGCCACCGGG CCCTACCCCT 360 

GCTGCGQGGC CGCGGGGCTC CX3AGGGAGGC 420 

ACGGACGATC ACGGAGGGAA GGCCGGC5GAA 480 

CCCAACCCTG QCGACAAGCC CATGACCCAG 540 

GGCGCQGTGC TQGTGTACTT CGTGGTCAGG 600 

ACTAGGAGAT ATGGAGTTTT GGACACTAAC 660 

CAGGATGATG AGGATGATGA CAACACGTTG 720 

TGTGCCTTTT GATGAAAGAA CTTTATCTTT 780 

AGGAATAAGA AGCCACTATA TCAATGTTGG 840 

ACCTTTAATT TGCTGTTGCA ATAAATACCX5 900 

AAGTACTCTR TTAATGGGCT CAGAGATGTT 960 

GTTTGAAAAT TACTATAAAA CGGTGTTTTC 1020 

GATTGTAAAT TGTTTTATGT ATTAATCAGT 1080 

GTTAAAGAGC TATAAATTCC AACAACXrAAC 1140 

TACTCAAAGG TATTTCCCAT TTTTGTGGGG 1200 

GGGTTTITAA AATATTAAGA AATGTCTAAG 1260 

TAAATTCTCT TAAAAAAAAA AAAAAAAACC 1320 

1337 



(2) INFORMATION FOR SEQ ID NO: 186: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 941 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : doxible 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

GGCACGAGCC TGGACGCAGC AGCCACCGCC GCGTCCCTCT CTCCACGAGG CTGCCGGCTT 
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AGGACCCCCA GCTCCGACAT GTCGCCCTCT GGTCGCCTGT C3TCTTCTCAC CATCGTTCGC 120 

CTCATTCTCC CCACX^GAGG ACAGACGTTG AAAGATACCA CGTCCAGTTC TTCAGCAGAC 180 

TCAACTATCA TGGACATTCA GGTCCCGACA CGAGCCCCAG ATGCAGTCTA CACAGAACTC 240 

CAGCCCACCT CTCCAACCCC AACCTGGCCT GCTGATGAAA CACCACAACC CCAGACCCAG 300 

ACCCAGCAAC TGGAAGGAAC QGATGGGCCT CTAGTGACAG ATCCAGAGAC ACACAAGAGC 360 

ACCAAAGCAG CTCATCCCAC TGATGACACC ACGACGCTCT CTGAGAGACC ATCCCCAAGC 420 

ACAGACCTCC AGACAGACCC CCAGACCCTC AAGCCATCTG GTTTTCATGA GGATGACCCC 480 

15 TTCTTCTATC ATOAACACAC CTTCCGGAAA aSGGQGCTGT TGGTCGCAGC TGTGCTGTTC 540 

ATCACAGGCA TCATCATCCT CACCAGTGGC AAGTGCAGGC AGCTGTCCCG GTTATGCCGG 600 

AATCATIGCA GGTCAGTCCA TCAGAAACAG GAGCTGACAA CCYGCTGGGC ACCCGAAGAC 660 

CAAGCCCCCT GCCAGCTCAC CGTGCCCAGC CTCCTGCATC CCCTCGAAGA GCCTGGCCAG 720 

AGAGGGAAGA CACAGATGAT GAAGCTGGAG CXyWSQGCTGC CGGTCCGAGT CTCCTACCTC 780 

25 CCCCAACCCT GCXXGCCCCT GAAGGCTACC TGGCGCCTTG QGGGCTGTCC CTCAAGTTAT 840 

CTCCTCTCYT AAGACAAAAA GTAAAGCACT GTGGTCTTTG CAAAAAAAAA AAAAAAAAAA 900 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAACTCG A 941 

30 



20 



35 



45 
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(2) INFORMATION FOR SEQ ID NO: 187: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

GAATTCQQCA CGAGGCAGCT TGTGCTTTAA AGGAGGTGTT CAAAGCATGT CTGAGCAGAG 60 

ACTTTTQGGC TCTGTTTTAA TTAATACTTT AAAATAATTC ATATTTAAAA TATCARATGT 120 

TTCCATAAAG AGGAGGATOT TTAAATGCCT CCAGACTACA TTCCTTTTTA TrSCTTGftTT 180 

50 TTACCTOGGA GTCCAAAGTT CAATTCCCAT AAAGCAAGCG TTTTATTTGT CACTTTCAAT 240 

ATACATCCGA TTGCCATGCT TAAGATGCAA TATGGGCTGC GGAAATAGGT TAACCCACAG 300 

GCTCCCAGGG CCCAGTOTAG AAGGTGA6AG ATTCGTC3TAA AATGATTCAA ATAAAAGGAA 360 

GACCCTOGCC GGGTCCCGTA RCTCACGCCT GTAATCCCAG CACTTTGGGA GGCCGAAGCG 420 

A£?IGGATGAC GAGCTTAGGA GTIGGAGACC AGCCTQGCCA ACATCGTGAA ACCCCGTCTC 480 

60 TACTAAAAAT ACAAAAATTA GOCGGGCATG GTGGCAGGCA CCTGTAATCC TAGCTAGTTG 540 
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QGAGGCTCAG GCAGGAGAAT CGTTTGAATC TGGGAGTTGG AGGTTGTCAG TGAGCTGAGA 600 
TCGCGCCACA GCACTCCAGC CTGGGTGACA GGGTGAGACT CTGTCTCAAA NAGA . 654 

(2) INFORMATION FOR SEQ ID NO: 188: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1848 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS : double 
15 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

GAAACTGGAC CGGAGAACCG GAGCGAAGCG AAGCGGAAGC CCGGAATGAG GCCGGACTGG 60 

AAAGCCGGAG CQGGGCCAGG CGGGCCTCCC CAAAAGCCTG CCCCTTCATC CCAGCGGAAA 120 

CCGCCQGCCC GGCCGAGCGC GGCGGCCGCT GCGATTGCAG TCGCGGCGGC GGAGGAAGAG 180 

25 AGACGGCTCC GGCAGCGGAA CCGCCTGAGG CTGGAGGAGG ACAAACCGGC CGTGGAGCGG 240 

TGCTK3GAGG AGCTCGTCTT CGGCGACGTC GAGAACGACG AGGACGCGTT GCTGCGGCGT 300 

CrcCGAGGCC CGAGGGTTCA AGAACATGAA GACTCGGGTG ACTCAGAAGT GGAGAATGAA 360 

GCAAAAQGTA ATTTTCCACC TCAAAAGAAG CCAGTTTGGG TGGATGAAGA AGATGAAGAT 420 

GAGGAAATGG TTCACATCAT GAACAATCGG TTTCGGAAGG ATATGATGAA AAATGCTAGT 480 

35 GAAACTAAAC TTTCGAAAGA CAACCTTAAA AAGAGACTTA AAGAAGAATT CCAACATGCC 540 

ATQGGAGGAG TACCTGCCTG GGCAGAGACT ACTAAGCGGA AAACATCTTC AGATGATGAA 600 
AGTGAAGAGG ATCAAGATCA TTTGTTGCAA AGGACTQGGA ATTTCATATC CACATCAACT " 660 

TCTCTTCCAA GAGGCATCTT GAAGATGAAG AACTGCCAGC ATGCGAATGC TGAACGTCCT 720 

ACTGTTGCTC GGATCTCCAT CTGTGCAGTT CCATCCCGGT GCACAGATTG TGATGGITGC 780 

45 TGGGATTAGA TAATCCIGTA TCACTATTTC AGGTTGATGG GAAAACAAAT CCTAAAATTC 840 

AGAGCATCTA TITCGAAAGG TTTCCAATCT TTAAGGCTTG TTTTAGTGCT AATGGGGAAG 900 

AAGTTTTAGC CACGAGTACC CACAGCAAGG TTCTTTATGT CTATGACATG CTGGCTQGAA 960 

AGTTAATTCC TGTGCATCAA GTCAGAGGTT TGAAAGAGAA GATAGTGAGG AGCTTTGAAG 1020 

TCTCCCCAGA TGGGTCCTTC TTO^rCATAA ATGGCATTGC TGGATATTTG CATTTGCTAG 1080 

55 CAATGAAGAC CAAAGAACTC ATTQGAAGCA TGAAAATTAA TGGAAGGGTT GCAGCATCCA X140 

CATTCTCTTC AGATAGTAAG AAAGTATACG CCTCTTCGGG GGATCGAGAA GTTTATGTTT 1200 

GQGATCTCAA CTCAAGGAAG TGCCTTAACA GATTTGTXXSA TGAAGGCAGT TTATATGGAT 1260 
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TAftGCATTGC CACATCTAQG AATCGACAGT ATGTTGCTTG TOGTTCTAAT TGTC3GW3IGG 1320 

TAAATATATA CAATCAAGAT TCTTGTCrCC AAGAAACAAA CCCAAAGCCA ATRAAAGCTA 1380 

5 TAATGRACTT GGTTACAGGT GTTACTTCTC TGACCTTCAA TCCTACTACA GAAATCTTGG 1440 

CAATTCCTTC AGAAAAAATG AAAGAAGCAG TCAGATTGGT TCATCTTCCT TCCTGTACAG 1500 
TATTTTCAAA CTTCCCftGTC ATTAAAAATA AGAATATTTC TCATCTTCAT ACCATGGATT . 1560 

TrrCTCCGAG AftffraGATAC TTTOCXTTTOG GGAATGAAAA QQGCAAGGCC CTGATGTATA 1620 

QGITCCACCA TTACTCAGAC TTCTAAAGAG ACTATTTGAA GTCCAGTTGA GTCACAAGAG 1680 

15 AMCCTCTCT TGATATATCA TCTCftGAAAC TTTCCTGAAT ATGTGATAAT ATATGGAAAA 1740 

TGATTTATAG ATCCAGCTCT GCTTAAGAGC CAC3TAATGTC TTAATAAACA TGTGGCMCT 1800 

TTKrtTTGAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAACTCGA 1848 

20 



25 



35 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 189: 



(i) SEQDENCE CHARACTERISTICS: 

(A) LQIGTH: 1146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTIOJ: SEQ ID NO: 189: 

AAAAAAAACC CAGQGGAACN TTOQGGGCCG CTTTNNNTTC CCCCTCCAGG CCATTGGGGA 60 

ATTCTTCAAG TTAATCCTGC TTTGCTCTTG GCCAACflGGG CTTGTAGGG6 GGAGAGACCC 120 

AGGATCATCA AGGGGTICGA GTCCAAGCCT CaCTCCCAGC CCTCGCAGGC AGCCCTCTTC 180 

40 GAGAAGACGC GGCTACTCTC TCGQGCGACG CTCATCGCCC CCAGATCGCT CCTGACAGCA 240 

QCCCACTGCC TCAftSCCCCG CTACATAGTT CACCTGGGGC AGCACAACCT CCAGAAGGAG 300 

GftGGGCTOIG AGCftGAdCCG GACRGCCACT GAGTCCTTCC CCCACCCCGG CTTCAACAAC 360 

AGCCTCCCCA ACAAAGACCA CCQCAATGRC ATCATGCTGG TGAAGATGGC ATCGCCAGTC 420 

TXXaVTCACCr QGGCICTVXX3- ACCCCTOWX: CTCTCCTCAC GCK^^ «0 

50 AGCTCYCTCA TTTCCGGCnG GGGCAGMACG TCCAGCCCCC AGTTACGCCT GCCTCACACC 540 

rrCSGATQCG CCAACATCAC CATCATTGAG CACCAGAAGT GTGAGAACGC CTACCCCGGC 600 

AACATXaCAG ACACCATCOT GTOIGCCAGC GTCCAGGAAG GGGGCAAGGA CTCCTGCCAG 660 

GCTGACICCG GGGQCCCICr GGTCTGTAAC GAGTCTCnC AAGGCATTAT CTCCTGGGGC 720 

CftGGATCCCT ©roCGATCftC COGAAAGCCT GGTGTCTACA OGAAflGTCTG CAAATATOTG 780 

60 GACK3GATCC AQGftGACGAT GftJtfSAACAAT TAGACTCGAC CCAOCCACCA CAGCCCATCA 840 
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CCCTCCATTT CCACTTGGTG TTTC3GTTCCT GTTCACTCTG TTAATAAGAA ACCCTAAGCC 900 
AAGACCCTCT ACGAACATTC TTTQC3GCCTC CTGGACTACA GGAGATGCTG TCACTTAATA 960 
ATCAACCTCG GGTTCGAAAT CAGTGAGACC TGGATTCAAA TTCTGCCTTG AAATATTGTG 1020 
ACTCTC5GGAA TGACAACACC TGGTTTGTTC TCTGTTGTAT CCCCAGCOCC AAAGACAGCT 1080 
10 CCTGGCCATA TATCAAGGTT TCAATAAATA TTTQCTAAAT GAAAAARAAA AAAAAAAAAA 1140 

1146 



15 



30 



40 



50 



ACTCGA 



(2) INFORMATION FOR SEQ ID NO: 190: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

ACTCCCTCAC CCAGC3TCCCA GCCCTGGGAA CCACCTACCG TGAC3CCCTTT TGCAGATATA 60 

GACTCATTTC ATCCTCAGAT GGTCCTTCAA GGTAGGTACT TTAGTCCCAT TTTAGAGATG 120 

AGACGATTCA GGCCAGAGGG GTGNNGTAAC TTGCCTGGGG C3CTCACGAGC ACAAAAGGAG 180 

CCGAGGCAGG ATCTCACCCT TCTTCTCTGG CCTCACTGCC CTCACTTTGC CATGACCCGA 240 

35 AOTTATCTrCC CTACAAAGCA ATCCATGGTC CAAQGYTCTT TTTATTGTAT TTTTATTTTT 300 
t 

AAGGGTCCTG TTCAAAACTG GTGTGAGCTC TGAGGAGTCC TGAACCCTGG GTGCAGCATC 360 

CTAGCATCCT GGGAGTCCTT TTCTGCCCAC ACTGAGCTGG GCTCCTCGAG GGGTGGGGCT 420 

GCTCTCCCTC GAAGCCTCGC AGCAGCACTG TATCGGGTTG GCTGAAGCTG ARCGCCGTGG 480 

GC?rGCAGGGC TCCMGGAATC CCCGTTTGGC TGAAGGGGTT CCCTGTAGCC MGQGATGTTT 540 

45 ATGAGGTCTC TCTCATGCCC CAGGCGCAGG ACATGTC3TGC GGGTGGAGAA AAGCAGGCCC 600 

TTTCAGTCCC AGCTCCACTC AATTTCTATG TGGACCAAGA ACGATAAACT TAAAAAATTT 660 

TTTTTCCTAA GC?rATCTTCA GAATATGGTG TATTTTTATG TGGAAAAGAA AAGTTATGAA 720 

GGCAGCTOTT ACTTTAAGAG AAAATTCATT AAAftGTCCTC GAGGTATGAA GATGACGGCG 780 

TCCTTCTCAA TCATTTTOGC ATAACTTGAT TGTGGCTGTA ATTTTTTTTT TTTTTTTTGr 840 

55 CAAGCATOIC AGACAATAAA GTCTTTGTAA AAAGRGAAAA AAAAAAAAAA AAAAAAAAAA 900 
ACTCGA 



60 
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(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1941 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

CrrCAGCTGA AGCCCAGGGA CCCCTTTTCC ACCCTGGGCC CCAATQCCGT CCTTTCCCCG 60 

CAGAGACTGG TCTTCGAAAC CCTCAGCAAA CTCAGCATCC AGGACAACAA TGTGGACCTG 120 

ATTCTCGCCA CACCCCCCTT CAGCCGCCTG GAGAAGTTGT ATAC3CACTAT GGTGCGCTTC 180 

CTCAGTGACC GAAAGAACCC GGTGTGCCGG AGATGGCTGT GGTACTGCTG GCCAACCTGG 240 
20 crcAOSQGGA CAGCCTCGCA GCTCGTGCCA TTGCAGTGCA GAAGC3GCAGT ATCGC5CAACC 
TCCTGGGCTT CCTAGAGGAC AGCCTTGCCG CCACACAGTT CCAGCAGAGC CAGGCCAGCC 

TCCTCCACAT GCAGAACCCA CCCTTTGAGC CAAYTAGTGT GGACATGATG CGGCGGGCTG 420 

CCCGCGCGCT GCTTGCCTTG GCCAAGGTGG ACGAGAACCA CTCAGAGTTT ACTCTGTACG 480 

AATCACGGCT GTTGGACATC TCGGTATCAC CGTTC5ATGAA CTCAKTGGTT TCACAAGTCA 540 

30 TTKJTOATGT ACTGTTTTTG NATTQGCCAG TCATGACAGC CGTC3GGACAC CTCCCCCCCC 600 

CGTCnCTGTO TGCCTGTCTG GAGAACTTAG AAACTGACTG TTGCCCTTTA TTTATGCAAA 660 

ACCACCTCAG AATCCAC?ITT ACCCTGTGCT GTCCAGCTTC TCCCTTGGGA AAAAGTCTCT 720 

CCTC?rrTCTC TCTCCTCCTT CCACCTCCCC TCCCTCCATC ACCTCACGOC TTTCrGTTCC 780 
TTGrCCTCAC CTTACTCCCC TCAGGACCCT ACCCCACCCT CTTTGAAAAG acaaagctct 

40 gcctacatag aagacttttt ttattttaac caaagttact gttgtttaca gtgagtttgg 

ggaaaaaaaa taaaataaaa atogctttcc cagtccttgc atcaacggga tgccacattt 960 

CATAACTCTT TTTAATaSTA AAAAAAAAAA AAAAAAATAC AAAAAAAAAT TCTGAAGGAC 1020 

AAAAAAQGTC ACTGCTGAAC TGTGTGTGGT TTATTGTTGT ACATTCACAA TCTTGCAGGA 1080 

GCCAAGAAGT TCGCAGTTGT GAACAGACCC TGTTCACTGG AGAGGCCTGT GCAGTAGAGT 1140 

50 GTAGACCCTT TCATOTACTG TACIGTACAC CTGATACTGT AAACATACTG TAATAATAAT 1200 

GTCTCACATG GAAACAGAAA ACGCTGGGTC AGCAGCAAGC TGTAGTTTTT AAAAATGTTT 1260 

TTAGTTAAAC GTTGAGGAGA AAAAAAAAAA AGGCTTTTCC CCCAAAGTAT CATGTGTGAA 1320 

CCTACAACAC CCTGACCTCT TTCTCTCCTC CTTGATTGTA TGAATAACCC TGAGATCACC 1380 

TCTTAGAACT GGTTTTAACC TTTAGCTGCA GCGNCTACGT CNAWCGMTGT GTATATATAT 1440 

60 GACGTKGTAC ATOGCACATA COCTTGGATC CCCACAGTTK GGTCCTCCTC CCAGCTACCC 1500 
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CTTTATAGTA TGACGACTTA ACAAGTTGGT GACCTGCACA AAGCXSAGACA CAGCTATTTA 1560 

ATCTCTTGCC CAGATATCGC CCCTCTTGGT C3CGATGCTGT ACAGGTCTCT GTAAAAAGTC 1620 

CTTGCTC?rCT CAGCAGCCAA TCAACTTATA GTTTATTTTT TTCTGGGTTT TTGTTTTGTT 1680 

TTGTTrTCTT TCTAATCGAG GTCTCAAAAA GTrCTAGGTT CAGTTGAAGT TCTGATGAAG 1740 

10 AAACACAATT GAGATTTTTT CAGTGATAAA ATCTGCATAT TrGTATTTCA ACAATGTAGC 1800 

TAAAACTTGA TCTAAATTCC TCCTTTTTTT CCTTTTTTGG CTTAATGAAT ATCATTTATT 1860 

CAGTATCAAA. TCTTTATACT ATATGTTCCA CGTGTTAAGA ATAAATGTAC ATTAAATCTT 1920 
GGrrAAGACTT TAAAAAAAAA A 
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20 



30 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 192: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2118 base pairs 
25 <B) TYPE: nucleic acid 

(C) STRANDEr»3ESS : double 
<D) TOPOLOGY: linear 



1941 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

AAATAATAAT AANAATAAAT AAAAATWAAG TGCTTAKTGT AACTCAGCGG ACAGGGCTCC 60 

CAGCTGCTCT GGCACCTGGG ACACCYTCCA CCCTGCACAC AACAGGCATG CAAAGAQGAC 120 

35 TGGATATCCT GGGGTAGAGT GCTTCTGGTG TGTTCACTTT AAGAAAACAT CTGCCAAGAG 180 

AGAAGAGTOC CCAGGAAAGA CCAGGAAAAT ACAAGTACAT GGCTGCTTCA TACCATATAC 240 

CCCAATTCTT TAAA£3CAGCA AAAGGCACTT TTTTTTTCAG GCCAGAGTGA ATCTAAAACA 300 

AACCTCGCTT TGCTTACAGG GAAGCTOTCC CAGAAGGACT GAGTGATGCC TCTTGTTCCC 360 

TAAGGTCTGG AGAGTCTTTC CAAGTTTCCA ACGACATTTC CAACCAGGTG GGAGAGACCA 420 

45 GCAGTTCACG AGACAAGTCA GACCCAAAAA ACGACGCCAA GGTAGTGAGT GGGTGCCTAT 480 

TTCGGAGTAG GATGATTTCA GGAAAACAGG AAGAAAAACC GGTCAGAAAG TGGCACTTTG 540 

GAAGIX3GAAA GCTOTTTCCA AATAGCAACT CTGGCTAAAG CGAAAATGTT AATCAAGTAG 600 

AAACTAAAAT TCAGGA-TCTT AGAAGCTCAT CCTTCTGATG AGAACTATTT TTTTTTCCGT 660 

GAAGGAACTA TTATTACrTT AAAAGTGAGG GTAATTTACA TATQGGGTGT ATATATTCTA 720 

55 AAAATACTAA TAAAAGTACC TTTTATAAGC AATGTTGTGT GGCrTGTAGA AGAAAGCAGG 780 

GAGGAAAAAA AGGCAGGCAA AACTAGTCTA GGTCTAGGCC CTAAAAATGA GCTTCCTTCC 840 
CACTTCACTG GAAACGCCCA TGTGATTTCT AGGCTGAAAA TAGGTAGGAT TTAACGAGTA 

60 
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ACCTAGTTCC CTTCTGTCTC TCATOTCTGA TCAGCTGATG GAGCTGCTAG TAAGAGGC3GC .960 
CGATCATCCT CCCAGACGAG TCCTTTCGCC TCTTGCTCTC CATCCCAAGC CTGACTCCTT 1020 



CAGCAGCAGC CCCCTCCITC TGTGTCCATC TGATGCACGC AAGCAGGAGC AGTAAGAGGG 



30 



40 



45 



(2) INFORMRTION FOR SEQ ID NO: 193: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENC3TH: 1538 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 
(D> TOPOIOGY: linear 



55 



1080 



CATCCCATCT TCCAGTTCAC CTTCTATGGG GTCACTARGA C3GTTCCCC3GT AACTAGGGCA 1140 
GCCCARGCCC AGCAGGTTCC AAAAC3CAGCT GCAAGCTTCA C5AAACCCACT TCCTCCAAGA . 1200 

CCAGC5GAGCT GC3CAGAGAGC CCATCCAAAA GCCCACTGGG AGAGGCATAA GATTCTGTGC 1260 

CAGGCCCCCA GCTCCCCrTCT GTOTCAGCSTA GGCTCTGCTA CTGGCCTCTG AAGTAAAGQC 1320 

15 AAANACAAAC GC3GCAGGC3CA GGGTCGCAGG AATAAAAAAC TCTCSGACAGA AACCCTTTTA 1380 

ATAAAGGAAA TTCCACCCCT CCCAATCCTT CCATGGAA£3G GTGAGACCTT AATGTGATC5T 1440 

AAGAQGAAGG TCTrCTCTGG CTTICAGGGA AACAGCTGCA GCTGAAACTT AGGGGCCCAT 1500 

20 

TCCAGGGCAC TTTTCACCAC AGCCAGTOCA GCCGCTCCAA GTGCCACTGT CAGCCCCATC 1560 

ACTGCCAATT TCACAAAGCG GTTGGTCCTT QGCTTGGTCA QGACATCTTT TGTTCGATCT 1620 

25 TCAGQCCQCA GAACTCCCCG AANACCGCTG CCGCAGCACC ATATCAQC3CC TCTGCTGGQC 1680 

.TGATGCCAGC TCAAAGTCTT TGAAAGTAGA GGCTGCCGTC CTCTCAGCTr QCTGTTGGGC 1740 

AGCQGCCTCC CGAGCAACTT CGGATGQGQG AAACTGAACA AAAAGGTCTC CTSTCTGCTG 1800 

ATCAGTCTCT CATAGGGCAA GTCCTGAGGG ATCTGGGACA ACAGGTGGTG GACCGAGGCC 1860 

ATCTCACAGT CACAGTCCAG GACTTCCTGC TCGCGATACA ACACAATCAC GGCTGCAAAG 1920 

35 TAAATCGGCA TCACTGGGTG GCAGGCCAGG AAGAAGTCAT ATAACCGCAC GACGTGCCTG 1980 

AAGTCAGACA GGACATCCCC AAAOCAGGTG ATGAGCCAGC TGAQGGCAAA GATGGTCCCT 2040 

ACCTCAGCAC TCTQCATCAA GTCATCGAGC TCTGGATTCA CCTGGTCAAT GATCGGCATC 2100 

2118 

AGATAGTTTA ATATATGC 



{Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: . 
CCGGGTTCQG CIXTEOTOTCA GCAQCOGGGC QGCGCTOGGG CGGGACATGG CAGCCTGTAC 60 
AGCCCGGCGG CCTGGCOC?3X3 GGCAGCCGCT GGTGGTCCCG GTCGCTGACT QIGGCCCGGT 120 
60 GGCCAAGGCC GCTCK?IGCG CGGCCGNAGC TGGAGCCTTC TCGCCAGCGT CGACCACGAC 



180 
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GACC3CGGAGG CACCTCTCGT CCXXSAAACCG ACCAGAGGGC AAAGTGTTGG AGACAGTTGG 240 

TOTCTTTCAG GIXXTAAAAC AGAATGGAAA ATATGAGACC GGGCAGCTTT TCCTTCATAG 300 

5 

CATTTTTGGC TACCGAGGTO TCGTCCTGTT TCCCTGGCAG GCCAGACTGT RTGACCGGGA 360 

TOrGGCTTCT GCAGCTCCAG AAAAAGCAGA CSAACCCTGCT GGCCATC3QCT CCAAGGAGC3T 420 

10 GAAAGGCAAA ACTCACACTT ACTATCAGGT GCTGATTGAT GCTCGTGACT GCCCACATAT 480 

ATCTCAGAGA TCTCAGACAG AAGCTGTGAC CTTCTTGGCT AACCATGATG ACAGTCGGGC 540 
CCTCTATGCC ATCCCAQGCT TOGACTATGT CAGCCATGAA GACATCCTCC CCTACACCTC 



35 



45 



50 



(2) INFORMATION FOR SEQ ID JK): 194: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 base pairs 
55 <B) TYPE: nucleic acid 

(C) STRANraTOESS: double 

(D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 



600 



CACTGATCAG CTTCCCATCC AACATGAACT CTTTGAAAGA TTTCTTCTGT ATGACCAGAC 660 

AAAAGCACCT CCTTTTOIGG CTCGGGAGAC GCTAAGQGCC TGGCAAGAGA AGAATCACCC 720 

20 CTCGCTGGAG CTCTCCGATG TTCATCQGGA AACAACTGAG AACATACGTG TCACTGTCAT 780 

CCCCTTCTAC AIGGGCATGA GGGAAGCCCA GAATTCCCAC GTGTACTGGT GGCGCTACTG 840 

TATCCGTTIG GAGAACCTTG ACAGIGATGT GGTACAGCTC CGGGAGCGGC ACTGGAGGAT 900 

25 

ATTCAGTCTC TCTGGCACCT TGGAGACAGT GCGAGGCCGA GGGGTAGTGG GCAGGGAACC 960 

AGK3TTATCC AAGGAGCAGC CTCCGTTCCA GTATAGCAGC CACGTCTCGC TGCAGGCTTC 1020 

30 CAGTGGGCAC ATCTQC3GGCA CGrTTCCGCTT TGAAAGACCT GATGGCTCCC ACTTTGATGT 1080 

TCGGATTCCT CCCTTCICCC TOGAAAGCAA TAAAGATGAG AAGACACCAC CCTCAGGCCT 1140 

TCACIGOTAG GCCAGCTCAG GCCCCAAGTG CCCAGGCTTG GTCACCGGGA AGAACAACTC 1200 

TCATCCCACA ATTGCTGCAG AACTCTTCTC TCCCCATCAT GGGCCACAGT GGGTCTCTTA 1260 

ATrTGATTGT GGGGTTCTTT TTGTGGC3GAG GGGTGGTATA ACTTTTCTTC AGAAGACCCA 1320 

40 TOIGGQACAC CTCCAAGGCT GGCCTCCTCA TAAGCCCTGC CTACACCATG TTCCAGTAAA 1380 

CCTCTCCACC AAGGAACIX3T GTrCAGCTGC CACAGGCCTG GAGGAGTTTC CTGGCCTGTC 1440 

ACGTGAQ6TT TGATCAGTAA ACCAGTC3CAS GYTTCGCCAA AAAAAAAAAA AAAAAAAAAA 1500 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAACTCGA 1538 
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(2) INFORMATION FOR SEQ ID NO: 195: 



AGACCCTOTC TCAAATAATA ATAATAATAA TAATCTTATT TTQGAGAATA AAGAGACCTS 60 

TGGATTTGAG GTCCCATTTG GGTAGAAAGA AAAGACGTTT ACACCGAGAA ATAGTCTGTG 120 

TTCCCCIGAA GGAGO^GAGG GATGCATCGC TGGAGGTGAC CTACAGTTGA AGAAGACTCA 180 

TTATGACAGA CCTTGTCCTT CTTCCTTCTG GAAAGTGTTT CCTCTGCTGC TACTGCTCAT 240 
GAGACTCTTC CCCCTCCCTG TCCCAGGGAA CCAAAGC3GCT TTNCTACXAC ACCCTTTCTT • 300 

NGCCCCCCGC CTCCrATGTC TGCTGTGCCT ITCTACTCAG CAATTCTTNG TTTGCTCCCA 360 

TTATCTTCCA GCCQGATACA GAGTOAATAG TTAACCACAC TTAGGTCAAA TAGGATCTAA 420 

15 ATTTTTCTTC CTGCTCCNGT GTAAAGAGGC CAGTGTTTGT GTGTTGCAAG CAGCCTTGGA 480 

ATAGTAACTC TTCTCATTTG TTTGGGATCT GGCCAMCAAG TTCCAGAATG ATACACGGAT 540 

CAGTGCAGAA GTTCATCAGG CTCTCGGACC TTAGGGCTGT TGGAGAAGGC TTCAGCAGCA 600 

GAACTCATGG TKAWKGYTCG TGTTCTCCAT CCTCAACTTT CTTTGCrTCG ATCATACACA 660 

AGAATACATT TGGAAGGGCA AAAAATGAAC ACTGTTGTTC ATTGCAGCCG TGTTTTGTGA 720 

25 CACAGATGCA CAGTCTGCTC TGAAGACCTT CTCTCAAGTG GSATVTQGGA GTCCATGCCA 780 

GATCATGGTG CTTCATCAGA GACTGACAGC TATCAGGGGT TGTGGCACTT AGTGAGGACT 840 

CTCCTCCCCC AGTGTGTCCT GATGACACAT ACACACCTGA CAATAGCTTG AGTCTTCTCT 900 

GOTCCTTTTA CTCTGTAGCC AACATACACA TGATTTAAAA CCCTTTCTAA ATATCTATCA 960 

TOGTTCATCC TTCTCCAAAT GCAGAGTCAG AGCTATTTGT ACTTCATTAT TATTTCCAAG 1020 

35 GCGAATAGTT GGCTTTCTTT TTGCAAAAAT AATTAAAGTT TTTGTATGTT GCAAAAAAAA 1080 
AAAAAAAAAA CTACGTAG 



1098 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

GAATTCGGCA CGAGATAGCT TGCATCTCAT CCCAGTAAAA CCACTTATTT ATAACATATC 60 

AACGTATTCA CAAGGTTGAA GAGCAAGATT GTTCTGAGGT GAGATGCAAA TTTCAAAGGG 120 

GTGAGCACTA ATTGTTCCAG TGATTGTrTA TTTATTGGCT AGGACATAAT TACTCTCTTT 180 

GAGGTTACAC ATCTOCCTCC AGGTTCCTGT GTGCTTGTGC CCTTGGGATC AGGCCAGGGC 240 
60 AGACroiGAT CACTGAGATT CAAACTCCCA GARTAATCAG CAAGAGCTTT CTAGAGACCA 
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AGGCCAGGCC TGATCCCTCA GGGATGCATG AGAAGGCTTG GAATCTCATT CTGCTATC3GT 360 

GGCTCTCTCT TGATCTTCTT GGAGTAQCAA AAACAGCAAT GTGGGCCCAA TGGTGTGGCC 420 

5 

TAAATGATCA CAAAGGTAAA TGAGTAAAGG GCTCAGCAGA TC3AGTAAGGA GCCTTGTCCT 480 

GAGAAATTAG CACTGGGCTC TGCATTCAGA AACATGTGAT AAGCATTGCC CATTGCACAT 540 

10 TGCCTTTATT GTCTAAGGAC ATGAAATTCC AGTTTTGCAT AGCTAGTGAT GAATACCTGA 600 

AGGGAATTGC AGACATATTT TATTTTATTT TTAATTGACA GATGGAATTG TATATATTTA 660 

TCATCTACAT AATCATGCTT TAAAATATGT ACATTATGGA ATGGCTAAAT CAAACTAACr 720 

TAGGCATTAT CTCATATAAT TCTCMTTTT GTGGCGAGAA GACTAAAAAT CTACCXriTTC 780 

AGCATTTTTA AAGAATACAA, TGTCJrTTTAT TAACAACAGT CACCATTTGG TACACTAGAT 840 

20 CTCTTGAACT TCTTCCTCTT ATCTAACTGA GATCTTGTAA CXTTTGATAA CAGCTCCCAA 900 

GCCCTTCCCC AACCACTCCT CCACCCGTGG TAACCACCAT TCTATTCTCA ACTTCCTGGT 960 

AATCACCATT CTAGACACAG GGAAGACTCT CTACCCTCTG A 1001 

25 
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(2) INFORMATION FOR SEQ ID NO: 196: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1443 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : .do\jble 
35 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

ATAAACTGAA ATAGGTCATG CAAATATAAA ATATTATTTT TAAATTATTT GTCATAftGAA 60 

ACGATGCTGG CCATATTTTG CTTTAATAAT GGAAAAAATG TGGTTAGCAT TCTRTGGAAG 120 

GTCGTCATCA GATAGTAGAC ATTTTCTAGG ATTTATTTCT ACCTGCATAT GTGGAAATGrT 180 

.45 GTACTACTTT AGATTTATWr AATGGCAGCT AACTCAGAGG CATCAAAATG TGCTAATGGT 240 

GTAATATCXSC CTTTGTCTTG CTGTYCTGTT TTGTARGCCT TCAATCAAGC ARGGGCAGGG 300 

CCCTACAGTC AACTTOICCT TTGSCAGACG CCAGCGTCTG CCCCTGACCC CGTCTCCACT 360 

CTCTGTGTCC TGGftGGflGGA GCCCCTTGAT GCYTACCCTG ATTCACCTTC TGCGTGCCTT 420 

GTACIGAACT GGGAAGAGCC GTX3CAATAAC GGATCTGAAA TCCTTGCTTA CACCATTGAT 480 

55 CTAGGAGACA CTAGCATTAC CGTGGGCAAC ACCACCATGC ATGTTATGAA AGATCTCCTT 540 

CCAGAAACCA CCTACCGGTG AGTGCAAGGG AGTAGAAATC TGCATCAGCA CATCAGCACT 600 

TGGGGATCTA AGTAAACCTC TCGGGGAAAA TGACCAAGTG GATGTCATCT CCCAGCTGTT 660 

60 
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840 



TCTAAfiAGCC CAGATGTCCA GAGTATTGTC TCACCTTTGAT CCCTCAG:^rC i!-Al---i3^iCC?G 
TGAAAAAGCC ACACTGGTTC AGGGACTCAC T3GACGCjrTT TGTGCCC^^ YrA-iCTrGCA. 
CCGTCTCTAC CCCAGAGTGG ACTCARATCC TCAPxTTC-.TC CTCTGA.-J3AT T'l^.-C-'rCAa-. 



AATTATAAAA GGGCTTTGGC AATATGTTAG CCCA-.3AATT TGGCTTCTTC C=i3AAACTGT- 900 



55 



GCCGACNTTA ACAGTGGCTT AAATGATGGT AAAAC-V-TA ^JSATTTCT.-A P^AS^PvTGGC^. 



(2) INFORMATION FOR SEQ ID NO: 197: 



960 



TTCGAGATAC GTTCACTTTT ATTAAACMAC CTATAGCTGT TTAATGAiTT CTAJ^AAAAA? 1020 



1080 



ATCTQGAGCT CAGQGGTTCA ACTGAGGGAA O-CATSTTSA GRATCATTCT TT ATTAATTA 
15 AATGCCAGGT AACCCGTTGA AATTATCAA^ AACATCTTCC ACGTACCA3A AASCPsCCTCA 1140 
GAGGATAGTT CTGTTATGGA GAAGATGAAA TSGTTT.-^A GTGTAGa-AC TArGGAAAGG 1200 
TGAGCTTAGA TTTGGATAGT AAAACCTCAA GACCCTAm AAAAAGTATT ITArGAATGC 1260 
AGCATAAATA ATTTAATTCA GTGTTAANAT GCCAAGGCTA GTATA.TTGAG CTa-ATGTGA 1320 
AAAGAAACTC ACATTGQGAG AATGCCACCT TTTCCTTATA AGATAGCTTT a-A3ATACCA 1380 
25 TTTTAGACAG ATGGAAATTG AATAC3CTTTA GAAAA3GCAA ATGTTTGArC TIGC-GGAAAA 
AAA 



1440 
1443 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1282 base psdrs 

(B) TYPE: nucleic acid 

(C) STRANDmNESS : double 

(D) TOPOLOGY: . linear 

40 {xi) SEQUENCE DESCRIPTION: SEQ 12 TO: 197: 

GAAAAAAAAA AGTATGACCC AGTAGCTAGG CACCTTOGC CCCGCCAAGT CGACACATAA 60 

AAITAACICT CACAGTATCA TCTTAGAAGT GAAAGftAGCC CCTTTATCCT GCAGTGCCCC 120 

TCTACCACCA CCTACTCACA AAGAACATQG TGCTirCTGG CMX3GGAGAA AC-STTCAGTr 180 

TCCTATGGCT TGTATGTGTC CCCTCAAATT CAAGrTSTTGC CAATGTG;i:A GCArCAAGAG 240 

50 GTCGGGTCTT TAAGAGATCA CTAGGCCATG AGGGArTCTC TTAGGACTGG GATGAAGGCC 300 

CATAATAAAA GAGCTTTCAG GGAGCATCCT GCTAJS-CTTGC CTTCTGTATG TGi^^^CACA 360 

GCAAGAAAGC CCTAGTCAAC AAGTGCCAGC TCCn^SAITCT TAGACTTCCC ATCCTCCAGA 420 

ACTGTCAGAA ATACATTTCT GTTCCTTACA AATTACXXAG ?CTCCTG?AT TCTOTTATAG 480 

CAGCACAAAA TGAAGATAOC ATACCTGAAC ACCTSAACAT TCTTCACAAG GTACTAAATG 540 

60 CACTCCTTTA TICTOGTCTC AGTATTGTGT GCTTAATAAG GAAATGAGAA AGGGTGGATC 600 
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AGGGCATAGG ATCAACAAGT TACTGCTAGA CCTCTCACPJl TGCCACT.-AT Oa- -----iSATT 

CTATTTTCAT CATTNCTTGT CTCTTCGGAA GCTAACACCA TGCTA- A.-.rA GC-ZATTAAAT 
AGATCTCTAA AAACACCTTA AGTATTTGTC TAGAAATTT^ GCGCACTSTC CJCX-J'SiG^J-X: 
CAAAATTCMA AATAATTTCA AAGGGCCTAA AGCACTAIOT AATQ^AA^-.TT CA7:;.3TTTT 
10 TAATGGTACT ACCACTCTCA AATTTAAAAT GTCATC77AC GrTCCTCTTC CTCC-CATTC-G 
ATTTATTGCT AAAACCTGGT AAACACTTTA ATCCYTTTCA ATTCCArr.-^ CACTC-CrrCTT 
GTCCAGAATT ACTCGCAGAC TAATAGTCAG CTGACTTCrC CCCCTC-ZATC CCGATTTGCT 
GTCTAATTCT GGTTACAAAT AAGTAACTGC CAAACTAATC TTTCTAAAA2i GCAA3ACTGA 
TCTCGTCACT CCTTTGCTCA ACAATGTAAA AGCTCCCArT GrTCTCCCAAA TAA-ACCAGC 1140 
20 TTTCCACTGT GTATACAATA CATCCATGAT CTGTATCCAS Ci-TCArTTTG TATT-XSCTCA 1200 
CTTTATACAC CACCCCCCAT GCCACATCAA ATTAA^TTAT CCTGATAPvA? GCA^rTGCAA 1260 
AAAAAAAAAA AAAAAAACTC GA ^282 

25 



(2) INFORMATION FOR SEQ ID NO: 198: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 951 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doti>le 
35 (D) TOPOLOGY: linear 



660 
720 
780 
840 
900 
960 
1020 
1080 



ixi) SEQUENCE DESCRIPTION: SEQ ID :/0: 198: 

ATTTCGGAAC GAGGACTGAA GTGGGAGCGG CGGCAGGGTA G^A^ACAGPA GGG3GATCTA 60 

TGTGCSTAACT AAAGAATCTT TCTGTTTTGT TAATTATTC?? GTGTGCGrTGG CTTTATTGTT 120 

TCCITAAGAG AATCAAAAAC TGAAAAAAAT GAGAATACAG GAAATGGCTC "rrGTrrATTT 180 

45 TTTTGCTGTG TTTACAGCTT GTTAATGCTC TACTGTCrTT GTTTCAiiGAG ASATTTGTTC 240 

ACrcCCCAGC TCGTTTTGTG TCCTGAGCCC TATGCCCAGC CG^rCTTArA AATC-.TGCCT 300 

GTTTAGATGT TTGATTTTGT TCTGTTTGCT ATTGTTATCT TAAJ^GGT^TTA rAACTCTGAC 360 

ATCCCAGACA TCAAATTAAG CTCAAATTAA GCTCTCGTTT AAATCTTAA ACACCTAATT 420 

TATATTCTAA TTCATCCCAG CCACTGATGC ATGTACTTTA GCTACITC^ CTAAATAAGC 480 

55 ATATTAAOTT TCCACATCAG GCCATCAGAT CTTGAGAACC AACACSTTACC CSGAATTCOG 540 

TCTCTACTAA TGTITCACCT GCATGCAGCC TTCATTAACT TTGTAGCAAA ATA7AAAGTG 600 

A1CATTATGT AGTTTCIGGA TTAAAAAAAT TTGTGTGTGA ^iTtTC-CTTTG 'rAA;CTGCAT 660 

60 
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15 



25 



35 



45 



451 

GTCGAATTAA TGGGACAGTC TGCCCTTTGT GTTAGATGTT AGAGCAAAAG AAAGGGCTTA 720 
TAGTGTTAGT ATTGGAGCAC TTTGAAGATA GATATTTTCA GAAAAGATGT AGGATTTAAA 780 
AGTTAAATTT TAAATTrTAG AAAAAGATAT GATGGCAATT GGAAATAGTC ACAATGAAGT 840 
TCTTCATCCA GTAGGTOTTT AACAGTGTTA TTTTGCCACT GGTAATGTGT AAACTGTGAG 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1740 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: doxjtole 
20 (D) TOPOLOGY: linear 



55 



900 



TGATTTACAA TAAATGATTA TGAATTCAAA AAAAAAAAAA AAAAAACTCG A 951 



(2) INFORMATION FOR SEQ ID NO: 199: 



(Xi) SEQUENCE DESCRIPTION: 'SEQ ID NO: 199: 
TTATTATAAT AATGATGATG ATTCCAAGGA AAAAACCTAC AGCGAATGTT CCATTTCTAC 60 

CCCGCACGCA GACflCTCTCC CTAACACTGA TAACCTGAGC CCCCAC3CACT GGACGGAAGA 120 

ATGCTCGCGT CTCCGTGTGT ACTGGTTCAG GGTTCTGGCC CCAGCCTTGT CAGGACCCCC 180 

30 TGGTCTCCAG AGCCCCCACC CCTCCCGCAA CAAGCAGCTG ATGCCCCAGT GATTCTCTAT 240 
ACATTTTTCA CCTCGGCCAA TATGTCCAGG AAAACTGCTT ACTTCTCTTT TCTTCCCTQG ■ 300 

AGCCTTCATT GTTCACCCTT ACGTTCCAAT ATAGGAATTA ATGCTACAAA ATAAAAGTAA 360 

AGCTTACCTG AAAAGTGCAT AGTTTGGGGC AATGGTATCT ACATCTCCCA CTGTGGGAAA 420 

ACCAGCAAAG CATCAAAACT CTCAATTCTC CTGTTACCRA ATGCAGATCT GAATTATAAG 480 

40 ATOTTTATGT TTOACCATTG TTTCAACAAT GGGATTTTGT TACGAATTAT CCCTTTAACT 540 

GAAACCCTCA GTTTTACTCT TTACATTATT AGGAAAACAG GGATATCTTT TGAATCTAAA 600 

AATITCATGT ACAGCAICTG ATTTTTGAAG TTTACATGTA AAGTCACAGT ATAQGTGAAA 660 

TAACGTTTOT CATATTTTGA GAOGTATCCT GCAGCCATGT TTTTACGTGA GTGTTTTAGT 720 
CAAAGTACAT GGTAGACAGT CTTTCACAAT AAAAGGAAAA QGATiTiTlT TCCTCCAAAT 780 
50 GTACATTTAT CAACCTAATC ATTGATITrT TTAAAAAGAG ATTTCGCCCC AGTCrGGriT 840 
ATGAAACTTC ATIGCCCTAA ACrGTOCTGA TTGTTTTTAA TCAAGTTATA AATTTCCAAC 900 
CTAGATCATG TATCTACCAA CICTCCTGCA TTTTCCAAAA GGCATTGAGC TTAAATATTA 960 

GTCrrcCTTA GAGTAGCTTA TCCACTTACA TGCTGCGCTA AAGCCATGCC TTTGAAACTC 1020 

CITOPTTAAA ACATCATATC ATTTTrGTGG GCAGTTTCAG AAAAGAAAAC AAACAAACAA 1080 

60 AAATCGACCC TTTAATTATT ACriGCAACT CAACAGATCT CCCTGCCGTA CTGCCTTTTC 1140 
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CAGGAACTTT ACTTCAGGGC TGTCCAGATT GCAGTTGTGC CX:CX3TGTATG TGGATCTAGT 1200 

TCACAGftGTC TTTGGAAGCC AQCAGTCGTG CCCTCCGTAT ACTGTCCACT CATTTTATGT 1260 

5 

AGATTTGGTA TCCTCAGCAG CCAGTCTTAA CACCACTGTC ACGTAGTTAN CAGATTCATC 1320 



TTTTATGTAT TTAAAGTAAT CCATACTATG ATTTGGTTTT TCCCTGCACC ATTAATTCTG 1380 



10 GCATCAGATC AGTTTTTCTG TTGTCAAGTT CTACTGTGGT TTGACCCAAG ACCACAACCA 1440 



TGAGACCCTC AAGTAAAGAT AAGGTACACA TACATTATTT GAGTAACTGT TTCCTTGGGG 1500 



GCCAATCTGT GTATGCTTTT AGAAGTTTAC AGAATGCTTT TATTTTTGTC TATAACAAAC 1560 

15 

AGTCTGTCAT TTATTTCTGT TGATAAACCA TTTQGACAGA GrTGAGGACGT TTGCCCTCTT 1620 

ATCTCCTAGT GCTAACAATA CACTCCAGTC ATGAGCXX3GG CTTTACAAAT AAAGCACTTT 1680 

20 TGATGACTCA MAAAAAAAAA AAAAAAAAMC VCGGQGQGGG GCCGGTAACC CATTTNNCCC 1740 



25 (2) INFOPMATIQN FOR SEQ ID NO: 200: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1707 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: do\lble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 
35 GCTTATAGAA GGGftGAGGAG CGAACATCGC AGCGCGTTGG CGGTTTTGGT GTGTCTCTGT 60 
GACCAIQGTG GTGGOGCTGC TCATCGTTTG CGACGTTCCC TCAGCCTCIG CCCAAAGAAA 120 



GAAGGAGATG GTGTTATCTG AAAAGGTTAG TCAGCTGATG GAATGGACTA ACAAAAGACC 180 

40 

TGTAATAAGA ATGAATGGAG ACAAGTTCCG TCGCCTTGTG AAAGCCCCAC CGAGAAATTA * 240 



CTCCGTTATC GTCATGTTCA CTGCTCTCCA ACTQCATAGA CAGTGTGTCG TTTGCAAGCA 300 



45 AGCTGATGAA GAATTCCAGA TCCTGGCAAA CTCCTQGCGA TACTCCAGTG CATTCACCAA 360 



CAGGATATTT TTTGCCATGG TQGATTTTGA TGAAGGCTCT GATGTATTTC AGATGCTAAA 420 

CATGAATTCA GCTCCAACTT TCATCAACTT TCCTGCAAAA GGGAAACCCA AACGGGGTGA 480 

50 . 

TACATATGAG TTACAGGTGC GGGGTTTTTC AGCTGAGCAG ATTGCCCGGT GGATCGCCGA 540 

CAGAACTGAT GTCAATATTA GAGTGATTAG ACCCCCAAAT TATGCTGGTC' CCCTTATGTT 600 



55 GGGATTGCTT TTGGCTGTTA TTGGTGGACT TGTGTATCTT CGAAGAGTAA TATGGAATTT 660 



CTCTTTAATA AAACTQGATG GGCTTTTGCA GCTTTGTGTT TTGTGCTTGC TATGACATCT 720 



GGTCAAATOT GGAACCATAT AAGAGGACCA CCATATQCCC ATAAGAATCC CCACACGGGA 780 

60 
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CATOIGAATT ATATCCATCG AAGCftGTCAA GCXCAGTTTG TAGCTGAAAC ACACATTGTT 840 

CTTCroirrA AlOGIGGACSr TACCTTAGGA ATCSCSTGCTTT TA1CTGAAC3C TGCTACCTCT 900 

5 GACATOGATA TIOGAAAGCG AAW3ATAATG TGTC3TGGCTG GTATTGGACT TGTTGTATTA 960 

ncvTcmrt ggatgctctc tatttttaga tctaaatatc atggctaccc atacagcttt 1020 

CTCATCM5TT AAAAAGCTCC CAGAGATATA TAGACACTGG AGTACTGGAA ATTGAAAAAC 1080 

GAAAATCXJTG TCTOTTTCAA AAGAAGAATG CAACrTGTAT ATTITGTATT ACCTCTITTT 1140 

TICAAGTCAT TTAAATAGTT AATCATTTAA CCAAAGAAGA TGTGTAC?rGC CTTAACAAGC 1200 

15 AATCCTCIOT GAAAATCTCA GGTATTIGAA AATAATTATC CTCTTAACCT TCTCTTCCCA 1260 

CTGAACTTTA TOGAACATTT AATTTAGTAC AATTAAGTAT ATTATAAAAA TTGTAAAACT 1320 

ACTACnror TTTAGTTAGA ACAAAGCTCA AAACTACTTT AGTTAACTTG GTCATCTGAT 1380 

20 

■m-ATATTGC CTTATCCAAA GATCGGGAAA GTAAGTCCTG ACCAGGTGTr CCCACATATC 1440 

CCTCTTACAG ATAACTACAT TAGGAATTCA TTCTTAGCTT CTTCATCTTT GrrGTGGATCT 1500 

25 GTATAcrrrA cgcatctttc cttttcagta gagaaattat gtgtctcatg tggtcttctg 1560 

AAAATCGAAC AOCATICrTC AGAGCACACG TCTAGCCCTC AGCAAGACAG TTGTTrCrCC 1620 

TCCTCCTTGC ATATTTCCTA CTCAAATACA GTGCTGTCTA TGATTGTrTT TGTTTTGTTG 1680 

30 

TTTTmGftG ATCACGOTAC TGQGCTC 



1707 



35 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 201: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 779 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANI^CNESS : doiible 

(D) TOPOLOGIf: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

CICTCCCCAG TOPTTCCAGG TAATCACTTG GCACTCCAGA GAAAGTTTCA TRCTGTTGCG 60 

TOKSGICGCT CCAAGCCAAG CACCTOGCAT GCAGGTCAGC CCTTCCCAGC GGGCGTGGCG 120 

50 TCGTCCICTT CACAGATCCC AOGTTOCAGC CCCAAGGCCT CACCATITIG CGTTTTTrAG 180 

AAACCCMTT TCTTCGTCAT TTATAAMCT GCTTTATftGA TATCITIGAT CTTGGCATGC 240 

CTTGCTTTCC TCTCCCTTCC CTCmCCAA TCCTGGTTTC CTAACCTCCT CTTCTAGTAA 300 

TTCICAACTC AACTCAAAGT CCX»AGAATT TGGAATGGTA GGATGCTGTG CQGGGAQCTC 360 

GAGGCIGAGG CATAATCACT GCTTCGGrrTC TGCTCATCAG GGGftCACGCT CrCTrACTCA 420 

60 TGGCAGCCAT GPITCAnCT CACAGAGCCC CCCGAATACT CTGTCTATAG TGACACACTG 480 
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TAGGICTCAT AAATTTTAAG AAACCTGCTT TTAAGTACTA TTTATAGGTT TTTCTGTTAT. 540 

ACTTCCAACC TAGTTTTAAA ATACATGAGG ATTTTATGAA AGCTTTATAC AGACATTTAT 600 

AGGAAACTCA TTCTTTGATT TTAGGTGCCA TTTAAATTGA TAACACTTAC TTTATAAAAA 660 

GATGCnTTT GTGTCGATAG AGCCTTATAG TTTAAAATAT CTTCATATAT TGCCATTTGA 720 

10 TCAAATAAAT TTCTTAGTTA GAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAACTCGA 779 



15 (2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1617 base pairs 

(B) TYPE: nxicleic acid 
20 (C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 
25 GGCACAGCTT TCTGTCTCTT CCTCGCTCCC TCTCTTTCTC TCCTCCCTCT GCCTTCCCAG 60 

TCCATAAAGT .CTCTOTCGCT CCCGGAACTT GTTGGCAATG CCTATTTTTT GGCTTTCCCC 120 

CGCGTICTCT AAACTAACTA TTTAAAGGTC TGCGGTCGCA AATGGTTTGA CTAAACGTAG 180 

GATCGGACTT AAGTTGAACG GCAGATATAT TTCACTGATC CTCGCGGTGC AAATAGCGTA 240 

TCTGGTGCAG GCCGTGAGAG CAGCGGGCAA GTGCGATGCG GTCTTCAAGG GCTTTTCGGA 300 

35 CICTTTCCTC AAGCTGGGCG ACACATOGCC AACTACCCGC AGCCTGGGAC GACAAGACGA 360 

ACATCAAGAC CGTGTCCACA TACT^GGAGG ATTTCCACAG CTGCACGGTC ACAGCCCTTA 420 

CGGATTGCCA GGAAGGGGCG AAAGATATGT GGGATAAACT GAGAAAAGAA TCCAAAAACC 480 

TCAACATCCA AGGCAGCTTA TTCGAACTCT GCGGCAGCGG CAACGGGGCG GCGGGGTCCC 540 
TGCTCCCGGC CTTCCCGGTG CTCCTGGTGT CTCTCTCGGC AGCTTTAGCG ACCTGGCTTr 
45 CCTTCTGAGC GTOGGGCCAG CTCCCCCCGC GCGCCCACCC ACACTCACTC CATGCTCCCG 

GAAATCGAGA GGAAGATCCA TTAGTTCTTT GQGGACGTTG TGATTCTCTG TGATGCIGAA 720 

AflCACTCATA TAQGATTOTG GGAAATCCTG ATTCTCTTTT TTATTTCGTT TGATTTCTTG 780 
TCTTTTATTT GCCAAATCTT ACCAATCAGT GAGCAAGCAA GCACAGCCAA AATCGGACCT 
CAGCTTTAGT CCGTCTTCAC ACACAAATAA GAAAACGGCA AACCCACCCC ATTTTTTAAT 
55 TTTATTATTA TTAATTTTTT TTGTTGGCAA AAGAATCTCA GGAACGGCCC TGGGCACCTA 960 

CTATATTAAT CATGCTAGTA ACATGAAAAA TGATGGGCTC CTCCTAATAG GAAGGCGAGG 1020 
AGAGGAGAAG GCCAGGG6AA TGAATTCAAG AGAGATGTCC ACGGACGAAA CATACGGTGA 



30 



40 



50 



600 
660 



840 
900 



1080 
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ATAATTCACG CICACGTCGT TCTTCCACAG TATCTTGTTT TGATCATTTC CACTGCACAT 1140 

TTCTCXrrCAA GAAAAGCGAA AGGACAGACT GTTGGCTTTG TCTTTGGAGG ATAGGAGQGA 1200 

GAGAGC3GAAG GGGCTGAGGA AATCTCTCGG GTAAGAGTAA AGGCTTCCAG AAGACATGCT 1260 

GCTATCCTCA CTCAGGGGTT AGCTTTATCT GCTOTr OT TG ATGCATCOGT CCAAGTTCAC 1320 

TCCCTTTATT TTCCCTCCTC CXTCrTGTTT TAGCTGTTAC ACACACAGTA ATACCTGAAT 1380 

ATCCAACGGT ATAGATCACA AQGGGGGGAT GTTAAATGTT AATCTAAAAT ATAGCTAAAA 1440 

AAAGATTTTG ACATAAAAGA GCCTTGATTT TAAAAAAAAA AGAGAGAGAG ATGTAATTTA 1500 

15 AAAAGTTTAT TATAAATTAA ATTCAGCAAA AAAAGATTTG CTACAAAGTA TAGAGAAGTA 1560 

TAAAATAAAA GTTATTGTTT GAAAAAAAAA AAAAAAAAAW CTCGACCGCA AGGGAAT 1617 



10 



20 



35 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 203: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

GAATTCGGCA CGAGGCTCAG GGAGCTGCAG CGCAGCAGAG TATCTGACGG CGCCAGGTTG 60 

CGTAGG1GCG GCACGAGGAG TTTTCCCGGC AGCGAGGAGG TCCTGAGCAG CATGGCCCGG 120 

AGGAGCGCCT TCCCTCCCGC CGCGCTCTGG CTCTGGAGCA TCCTCCTGTG CCTGCTGGCA 180 

CTCCGGGCQG AGGOCGGGCC GCCQCAGGAG GAGAGCCTGT ACCTATQGAT dsATGCTCAC 240 

40 CAGGCAAGAG TACICATAGG ATTTGAAGAA GATATCCTGA TTGTTTCAGA GGGGAAAATG 300 

GCACCTTTTA CACATGATTT CAGAAAAGCG CAACAGAGAA TGCCAGCTAT TCCTGTCAAT 360 

ATCCATTCCA TCAATTTTAC CTGGCAAGCT GCAGGGCAGG CAGAATACTT CTATGAATTC 420 

CTCTCCriGC GCTCCCTGGA TAAAGGCATC ATCGCAGATC CAACCGTCAA TGTCCCTCTG 480 

CTGGGAACAG TGCCTCACAA GGCATCAGTT GTTCAAGTTG GTTTCCCATG TCTTGGAAAA 540 

50 CAGGATCGGG TOGCAGCATT TGAAGTGGAT GTGATTGTTA TGAATTCTGA AGGCAACACC 600 

ATTCTCCAAA CACCTCAAAA TGCTATCTTC TTTAAAACAT GTCAACAAGC TGAGTGCCCA 660 

GGCGQGIGCC GAAATGGAGG CTTTTGTAAT GAAAGACGCA TCTGCGAGTG TCCTGATGGG 720 

TTCCACGGAC CTCACTOTGA GAAAGCCCTT TGTACCCCAC GATGTATGAA TGGTGGACOT 780 

TGTGTCACTC CTGGTTTCTG CATCTGCCCA CCTGGATTCT ATGGAGTGAA CTGTGACAAA 840 

60 GCAAACTGCT CAACCACCTC CTTTAATGGA GGGACCTGTT TCTACCCTGG AAAATGTATT 900 
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TSCCCrCCAG GACTAGAGGG AGAGCAGTGT GAAATCAGCA AATGCCCACA ACCCTGTCGA 960 

AATGGAGGTA AATX3CATTCG TAAAAGCAAA TGTAAGTKTT CCAAAGGTTA CCAQGGAGAC 1020 

5 

CTCTGTTCAA AGCCTGTCTG CGAGCCTGGC TGTGGTGCAC ATGGAACCTG CCATGAACCC 1080 

AACAAATGCC AATGTCAAGA AGGTTGC3CAT GGAAGACACT GCAATAAAAG GTACGAAGCC 1140 

10 AGCCTCATAC ATGCCCTCAG GCXi^GCAGGC GCCCAGCTCA GGCAGCACAC GCCTTCACTT 1200 

AAAAAGQCCX3 AGGAGCGGCG GGATCCACCT GAATCCAATT ACATCTGGTG AACTCCGACA 1260 

TCTGAAACGT TTTAAGTTAC ACXIAAGTTCA TAGCCTTTC3T TAACCTTTCA TGTGTTGAAT 1320 

GTTCAAATAA TGTTCATTAC ACTTAAGAAT ACTGGCCTCA ATTTTATTAG CTTCATTATA 1380 

AATCACTCAG CTCATATTTA CTCTTCCTTT TAAGTTTTCT AAGTACGTCT GTAGCATGAT 1440 

20 GGTATAGATT TTCTTGTTTC AGTGCTTTGG GACAGATTTT ATATTATGTC AATTGATCAG 1500 

GTTAAAATTT TCAGTGTCTA GTTGGCAGAT ATTTTCAAAA TTACAATGCA TTTATGGTGT 1560 

CTCGGGGCAG GGGAACATCA GAAAGGTTAA ATTGGGCAAA AATGCX?rAAG TCACAAGAAT 1620 

TTGGATGGTG CAGTTAATGT TGAAGTTACA GCATTTCAGA TTTTATTGTC AGATATTTAG 1680 

ATOnTOTTA CATTTTTAAA AATTGCTCTT AATTTTTAAA CTCTCAATAC AATATATTTT 1740 

30 GACCTTACCA TTATTCCAGA GATICAGTAT TAAAAAAAAA AAAATTACAC TGTCGTAGTG 1800 

GCATTTAAAC AATATAATAT ATTCTAAACA CAATGAAATA GGGAATATAA TGTATGAACT 1860 

TTTTGCATTG GCTTCAAGCA ATATAATATA TTGTAAACAA AACACAGCTC TTACCTAATA 1920 

AACAOTITAT ACTGTTTCTA TGTATAAAAT AAAGGTGCTG CTTTAGTTTT CTGA 1974 



25 



35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 204: 



(i) SEQUENCE CHWRACTERISTICS : 

(A) LENGTH: 1057 base pairs 
45 " (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

CGGCCTTCCG GGGCAACCGT TCGTCCCAAC NCGGGAAAGG GTCCTGGAGN CGGGAACTAG 60 

GAGCCTCGGA AGTCCAAGGG CGGAGCGCCC riTGCTAATA AGCCAATCAG AACGTGAGAC 120 

55 GCTCCGGTGG <aiC0C5TOCCG TCGAGCGCGG GGTGGAGTCT GGGTGACTTG GCTGGCGGGA 180 

TCAAGTGCAG CTGCTTCAGG CTGAQGTGGC AGATAGTGAG OGCTGGTOGC QGAGTTAAAG 240 

TYAAAGCAGG AGACTAfllWA TGAATAGCGC AGCGGGATTC TCACACCTAG ACCGTCQCGA 300 

60 
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457 

GCGGGTTCTC AAGTTAGGGG AGAGTTTCGA GAAGCAGCCG CGCTGCGCTT CCACACTGTC 360 

CGCTATGACT TCAAACCTGC TTCTATTGAC ACTTCTTCTG AAGGATACCT TGAGKTTGGC 420 

GAAGRTGAAC AGKTGACCAT WACTCTGCCM AATATAGAAA GTTGAAGGAA C3CAGTAAAAT 480 

TCAGTATCCT AAAGAACAAC AGCAACAACA ATGTGGAATT CASCCAGGAC TCCCAATCTT 540 

OTAAAACATT CTCCATCTGA AGATAAGATG TCCCCAGCAT CTCCAATAGA TGATATCGAA 600 

AGAGAACTGA AGGCAGAAGC TAGTCTAATG GACCAGATGA GTAGTTCSTGA TAGTTCATCA 660 

GATTCCAAAA GTTCATCATC TTCAAGTAGT GAGGATAGTT CTAGTGACTC AGAAGATGAA 720 

15 GATTGCAAAT CCTCTACTTC TGATACAC3GG NAATTGTGTC TCAGGACATC CTACCATGAC 780 

ACAGTACAGG ATTCCTGATA TAGATGCCAG TCATAATAGA TTTCGAGACA ACAGTGGCCT 840 

TCTCATGAAT ACTTTAAGAA ATGATTTGCA GCTGAGTGAA TCAGGAAGTG ACAGTGATGA 900 

CTGAAGAAAT ATTTAGCTAT AAATAAAAAT TTATACAGCA TGTATAATTT ATTTIGTATT 960 

AACAATAAAA ATTCCTAAGA CTGAGGGAAA TATGrrCTTAA CTTTTGATGA TAAAAGAAAT 1020 

25 TAAATTTGAT TCAGAAAAAA AAAAAAAAAA AACTCGA 1057 



30 (2) INFORMATION FOR SEQ ID NO: 205: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 721 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 205: 

40 GAATTCGGCA CGACTCATCC CTCTCCCTCT TTCACTCCCT TACTCTTACT CTGTTTTTTG 60 

TGCTCCAGAC AGACAGACCC TACCTdTTT GCTTCTTTTT TGTTTGTTTG TTTTGAGATG 120 

GAGICTCGCT CTTCTTCCCC AGGCTGGAGT GCAGTGGCGC AATCTCGGCT CACCACAACC 180 

TCTGCCTCCC GGGTTCAAGC AATTCTCCTG CCTCAGCCTC CCGAGAAGCT GGGGATTACA 240 

GGCATGCGCC ACCACACCCA GCTNAATTTT ATATTTTTAG TAGASATGGT GTTTCTCCAT 300 

50 C?rTGGrCACX5 CIGGCCTCAA ACTCCCAACC TCAGGTGATN CCGCCTGCTT TGGCCTCCCC 360 

AAAGTOCTGG GATTACAGGC GTGAGCCACT GCGCCCAGCC TCTTTTGCTC CTTTATACTC 420 

ATTAACTCAC GCCTGTAATC CCTGTTTTGG GAQGCCAAAG TGAGAAGGTT GCTIGAQGCC 480 

AAGACTTTGA GACTAGOCTG GGCAACACAG CAAGATGCCA TCTTTATAAT AAAAATAAAA 540 
ATAAAAATCA ATTAGCTGGG CATGGTQGAA CGCACCTGTA GTCCCAGCCA ATTOAGACSGC 



600 



60 TGAAGIGGGA GGATCATTCA GCCCAGGAGT TGAGGTTGCA GTGAGCCATG ATCATGTCAC 660 
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TACACTCAGC CTCQGCAATA GftGGGACATG TTGTCTCTAA AARAAAAAAA AAAAAACTCG 720 

721 

A 



10 



30 



40 



(2) INFOFMATIOM FOR SEQ ID NO: 206: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2465 base pairs 

(B) TYPE: nucleic acid 

(C) STRftNDEDNESS : dovible 

15 (D) TOPOIiOGif: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 206: 
CCACCATTTA TCCAACTCAA GAGGftGTTAC AGC3CAGCTCA C3AAAATTC3TT TCTATTACTG 60 

20 

AftCGrecm AAAACTCCTT TCAGACAGTT TOTCTGAACA TGAGAAGAAC AAGAACAAftG 120 

AGGGAGATCA TAAGRAAGAG QGAGCOTAAftG ACflGAGCTTT GAAAGGAGTT TTGOCSftGTQG 180 

25 GACTATTCGC AAAAGGATTA CTTCTCCGftG GAGATAGAAA TGTCAACCTT GTPrroCTCT 240 

GCTCAGAGAA ACCTTCAAAG ACATTATTAA GCCCTATTGC AGAAAACCTA CCCAAACAGC 300 

TTGCICTrAT AflBCCCTGAG AACTATGACA TAAAATGTGC TGTATCTGAA GCGGCAATAA 360 

TTTIGAATTC ATOK3TGGAA CCCAAAA-TOC AAGTCACTAT CACACTGACA TCTCCAATTA 420 

TTCGAGAAGA GAACATCftGC GAAQGAGATG TAftCCTCQGG TATQGTGAAA GACOCACOQG 480 

35 ACQTCTTOGA CAGGCAAAAA TCCCTTCACG CICTQGCTGC TCTACGCCAC GCTAAGTGGT 540 

TCCAGGCTAG AGCTAATCGT CTGC!«?rcCT GTGTGATTAT CATACGCATT CTTCGAGACC 600 
TCTGTCAGCG AGTTCCAACT TQGTCTGATT TTCCAAQCTG GGCTATGGAG TTACTAGTAG 
AGAAAGCAAT CAGCAGTCCT TCTAGCCCTC AGAGCCCTGG GGATGCACTG AGAAGAGTTT 

rrcaATGCAT TrcrrcsGGG attattctta aaggtagtcc tggacttctg gatccttgtg 

45 AAAAGGATCC CmXSAIACC TTCQCAACAA TGACTGACCA GCAGOGTGAA GACATCACAT 840 

CCACTGCACA GmOCAaro AGACTCCTtG CATTCCGCCA GATACACA^ 900 

TOGATCCATT ACCGCAAATC AGCCAACGTT TTAACATCCA CAACAACAGS AAACGAAGAA 960 

GAGATAGTOA TCGAISTTOAT GGATTTGAAG CTOAGGGGRA AAAM3ACAAA AAAGATTATG 1020 

ATAACnTTA AAAAfflCTCT GTAAATCTTC AGTGTTAAAA AAACAGATOC CCATITOrTG 1080 

55 GCIOTTTTTC ATIXaiAATA ATGTCTACAT TGAAAAATTT ATCAAGAATT TAAAGGATTT 1140 

CAJTOGAAGAA CCAASmTP CTATCATAIT AAAAAAaOTA CAGTGTTAGG TATEATTTGA 1200 

ATOGAAAGAC ACCCAAAAAA AAAAATGTGC TCCGACTAGG GGGAAAACAG TAGTTCCGAT 1260 

60 



660 
720 

780 
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10 



15 



20 



25 



30 



35 



40 



TTTTTCCCAT TATTTTTAOT TTATTTTCTG GTTGCCCTAG CTTCCCCCCC TATTTTTGTG 1320 

TCTTTTATTA ACTAGTGCAT TCTCTTATTA AATCTTCACT GTATTTAATG CAGGATGTGT 1380 

GCTTCACTTG CTCTOTGTAT TITCATATTT TAATTTAGAG GTTTTGTTTG CmTTGACA 1440 

CTAGTTOTAA GTTACTTTGT TATAGATGGT ATCCTTTACC CCTTCTTAAT ATTTTACAGC 1500 
AGTACGTTTT TTTGTAACGT GAGACTGCAG AGTTTGTTTT TCTATATGTG AAGGATTACA * 1560 

ACACAAAAAG TTATCCK3CC ATTCGAGrTGC TCAGAACTGA ATGTTTCTGC AGATCTTCTG 1620 

GCATTTGTCT CTAGTGTGAT ATATAAAGGT GTAATTAAGA CAGAGTTCTG TTAATCTAAT 1680 

CAAGTrrcCT GTTAGTTOTG CATTAGCAGT ATAAAAGCTA ATATATACTA TATGGTCTTG 1740 

CAACAGTTTT AAAGCCTCTG CATAATTGAT AATAAAAATG CATGACATTC TTGTTTTTAA 1800 

TAGACTTTTA AAATCATAAT TITAGGTTTA ACAOGTAGAT CTTTGTACAG TTGACTTTTT 1860 

GACATAGCAA GGCCAAAAAT AACTTTCTGA ATATTTTITr CTTGTGTATA AGTGGAAAGG 1920 

GCATTTTTCA CATATAAGTC GGCTAACCAA TATTTTCAAA AGAACTTCAT CATTOTACAA 1980 • 

CTAACAACAG TAACTAGCCC TTAATTATGG TGACAGTTCC TTATTGGTGT GTGTGAGATT 2040 

ACTCTAGCAA CTATTACAGT ATAACACAGA TGATCTTCTC CACACACCCC ATCACCCAGA 2100 

TAATTTAGAG TTCTCTTAAC AGTGAGGTTG ATAAAGTATT ACTGATAAAA AATTATCTAA 2160 
GGAAAAAAAC AGAAAATTAT TTGGTGTGGC CATCTTACCT GCTTATGTCT CCTACACAAA 2220 
GCTAAATATT CTAGCAGTCA TGTAATGAAA AATTACATCT TACTGTTGAT ATATGTATGC 2280 
TCTGGTACAC AGATOTCATT TTGTTGTCAC AGCACTACAG TGAAATACAC AAAAAATGAA 2340 
ATTCATATAA TGACTTAAAT GTATTATATG TTAGAATTGA CAACATAAAC TACTTTTGCT 2400 
TTGAAATX3AT GTATX3CITCA GTAAAATCAT ATTCAAATTT AAAAAAAAAA AAAAAAAAAA 2460 
CTCGA 2465 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTI(»I : SEQ ID NO: 207: 
GAATTCGGCA CGAGCTCAAG CIGGCAGGTG GTCGGGGGAG CGGCCGGAGA QGAGCTGCCG 
GGAGTTCGTG CCCIGCAGGA CATGACACCA GTGGCATATC ACGQCCATGG GGTCTCAGCA 
TTCCGCTGCT GCTCGCCCCT CCTCCTGCAG GCGAAAGCAA GAAGATGACA GGGACGGTOT 



60 
120 
180 
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GCTGGCTCAA CGAGfiGCAGG AAGAAGCCAT TGCTCAGTTC CXATATGTGG AATTCACCC5G 240 

GAfiAGATAGC ATCACXTOTC TCACC?rQCCA GGC5GACAGGC TACATTCCAA CAGAGCAAGT 300 

5 

AAATGAGTTG GTCGCTTTGA TCCCACACAG TGATCAGAGA TTGCGCCCTC AGCGAACTAA 360 

GCAATATGTC CTCCTCTCCA TCCTGCTTTG TCTCCTGGCA TCTGGTTTGG TGCTTTTCTT 420 
10 CCTOTTTCCG CATTCAC3TCC TTGTC5GATGA TGACGGCATC AAAGTGGTGA AAGTCACATT 



15 



25 



.35 



50 



{2) INFORMATION FOR SEQ ID NO: 208: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

CAGTATTTCC CTCACTACTG TAAGCAAAAG TGGTATGTTT TTCTTTCTTT ATCTCTACTC 

60 



480 



TAATAAGCAA GACTCCCTTG TAATTCTCAC CATCATGGCC ACCCTGAAAA TCAQGAACTC 540 

CAACTTCTAC ACGGTGGCAG TGACCAGCCT GTCCAGCCAG ATTCAGTACA TGAACACAGT 600 

GOTGAATTTT ACCGQGAAGG CCGAGATCGG AGGACCGTTT TCCTATGTGT ACTTCTrCTG 660 

CACGGTACCT GAGATCCTGG TGCACAACAT AGTGATCTTC ATGCGAACTT CAGTGAAGAT 720 

20 TTCATACATT GGCCTCATGA CCCAGAGCTC CTTGGAGACA CATCACTATG TGGATTGTGG 780 

- AGGAAATTCC ACAGCTATTT AACAACTGCT ATTGGTTCTT CCACACAGCG CCTGTAGAAG 840 

AGAGCACAGC ATATOTTCCC AAGGCCTGAG TTCTGGACCT ACCCCCACGT GGTGTAAGCA 900 

GAGGAGGAAT TGGrTTCACTT AACTCCCAGC AAACATCCTC CTGCCACTTA QGAGGAAACA 960 

CCTCCCTATC GTACCATTTA TGTTTCTCAG AACCAGCAGA ATCAGTGCCT AGCCTGTGCC 1020 

30 CAGCAAATAG TTOGCACTCA ATAAAGATTT GCAGAATTTA ATACAGATCT TTTCAGCTGT 1080 

TCTTAGGGCA TTATAAATGG AAATCATAAC GTGGTTCTAG GTTATCAAAG CATGGAGTGA 1140 

TCTGGAGCTA GGATTGTCAG TGACCTGCAG GCCATTATCA GTGCCTCATC TGTGCAGAAG 1200 

TCGCAGCAGA GAGGGACCAT CCAAATACCT AAGAGAAAAC AGACCTAGTC AGGATATGAA 1260 

TTTCTTTCAG CTOTTCCCAA AGGCCTGGGA GCTTTTTGAA AAGAAAGAAA AAAGTGTGTT 1320 

40 GGCmnTT TTTTTTAGAA AGTTAGAATT GTTTTTACCA AGAGTCTATG TGGGGCTTGA 1380 

TTCACCCTTC ATCCATTCGC TGGAACATGG ATTGGGGATT TGATAGAAAA ATAAACCCTG 1440 

cttttgattc AAAAAAAAAA AAAAAAWAAA AAAAACTCGA 1480 

45 
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TCrCCTCTGT GGCCTTCTGG TCTACCCCTC TCTTCCTAGC CATTCAGTCT CTCTAGTCAC 120 

CTCCCTAGTA CXH'AGTGCTC TCTAAGTTTT TATTTAATTA GAACAACTCC ATTTCCATTT 180 

CAAGGTAGCT CAATCGGGGG AAAAGCCTCA TGATTTAAAC TGAAGTTAAC AACACAGCTT 240 

TTAAAATCAA AACTCATACT CCAACTTCTA AAGTATATTT GAC3CTGATTT GTTTCCAAAA 300 
CAAAGATATG CICTACCTAA AACTC5CTAAA ACAAAAATAT AAAGACAAGG ACTAGGTGAT ' 360 
TAAGC3GGAGA GAAAAATCAT YTCTTTTCCA GGAAACXTTT QCTAAAATAA GCAAAACTTG ■ 420 

AWrCTATCCT TCATGGAAAC TGACACAAAG AAAAGAAACT GATGGATTGC ACAGGCCTTG 480 



15 TTATAGAAAT AGATCTATAA AAAGATCTGT CCACAGGAAA TATACACCTT CTCCTGGTTC 540 



TCAACTTCAA TGGGGATTTG TCACCTAGGT CTCCATCTAT AGGAATACCT TCACATACCT 600 

ATCTATTCAT GCACATATTC TGAAAACAGG TACATACAAA ATTACAACAA AGGAAAAAAA 660 

TTCTATTX3AA CACTTAAAAA TAGAAACAGG CCAGGCACGG TGGCTCATGC TGTAATCCCA 720 

ACAATTTOGG AGGCTGAGGC TGGTGGATCA CCTGAGGTCA GGAGTGTGAG ACCAGCTTGG 780 

25 CCAACATGGT GAAACCCXX3T CACTACTAAA AATACAAAAA AAATTAGCCT GTGTGGTGQC 840 

ACACTCNTAC AATCCNGGCT GACTCGGGAA AN 872 



(2) INFOPMATION FOR SEQ ID NO: 209: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LE^K3TH: 1779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEBNESS : double . 

(D) TOPOIiOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

AATTGCCAAG ACTGCACAAA ATTACAGTGC TAATGTATAT GGTTGCAGTT CACATAAAGA 60 

CAAAAGCATC TGTTATGAAA TGAGTAGTAA TATTQGGTGG TOSavrTTGrrT CTTAGCAGAC 120 

TTCGCTTCAT WTTGGTCTTC AGATAAAATG GCCAGCATAA ATGCTGTTTA TATTCACGTT 180 

TTCCTAGGTG TOTCTCTGCA GGCCACAGCA GCATGCCCTT GGTGTAGTCA GTGCCGAAAS 240 

50 GGGTCTOTTC CTTCTTCAGC CTGCCTGCAG GGATGGTCTC CmTAAAGC AGGTTGTCTG 300 

CAGCATTCAG TACACTGAAG GTAAGCTAAA CCATCAACAT CTCTGGTGTT TTAAGATGTT 360 

ATTTTATTCG AACAACTCAC AAATGAGGGA TGTTAGCTTT GTCGCAGAAT TCCCTGCATG 420 

TOTGATAACT GATCTTOTTT TATTTTTrOG CATTGCAACT GTGGCATAGT TACAATTTCT 480 

GTTIGKTCAT CACATTTAAA ATTGGRAGAG AACGCGCTTG AKGGATAGAG CGCCTTCAGK 540 

60 CTTACIGTITC TTATTAACTT TACTTTTrTT AAATCAACTT GCTATAGACT TTATATACAT 600 
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TTTGTTAAAT ATAGTTCCTA GOXSACATAGA AACGATC5CGT AGTTTTCATT TACTAATTAC 660 

AAATOTIGAG GOCTAATTCT GAAAGTCCTC ATATTTAAAG GCTAGACAAC C3TAATGAAAT 720 

TTTTAACTAT TTGTATGTCA TTTTGAAAGT GTACTGCTTT ATGGTAAAAG TGTTTTTCAT 780 

TTCTTCATTG TTTTCATTAT TTGTGATCAT GTTGTCTTTC AATACAQGCA TAAACCTTCC 840 

10 ACTCTTGAAC AAAGCAGCTG CTTTTTAAAA GCGGTAATTG CTTCTTTACC TTTTATTTCT 900 
TTTCTAAATG AAGCTTTTCT TTAAGAATGT GACTTTAAAG TGTTGTCTAT TGCATAAAAC 960 

ACTTGACACT CACTTATTGT AAACTGAAGA TTGTTCTACT GCATGTGAAG TGGACCATGC 1020 

15 

AGATTTCTGT ATOTTCTCAG TATGCATCAC TAGATAATAA AGTCTTTTGT GAACAAGGCA 1080 

TTICTAGCCA TTTTTAAAAG TTTTTCTCTT CAGTGCTQGT AAGTCAQGTA AACCATAAAT 1140 

20 AOTTAAAAGC AACCTTTTGT TTTTTTCCTG AAAGTTTTTA ATTGAAAGTA TTATTAGTTA 1200 

AAGATGTAAA CCTAGCCAAA ATTACCAGTT TATTAATAAT TAGGATCCTA ATTATTTCAA 1260 

AAAATCCTAC AAATATTGTC AGCTTTCAGT GTAGTCAGAT TATTCCTGTA GGTTATGGGG 1320 

TATAATTCAG GATTTAACTA ATGTTTCTGC TATrTTTCTCA CTTTTCCTTT TGATGGTGCG 1380 

GAAAGAGAAA AftCGAAAACG GGGCACAGGC CATTCGACGC CTTCTCCAAG GGGTCTGATT 1440 

30 TCCTCAGACA CCAGCTTCAC CTTCTTAACA AGGCACCTAA TTACAACAAG CATGCACATT 1500 

nGGTGCATT CAAGAATXSGA. AAATCAGAAT AGCAGCATTG ATTCTTCTGG TGCAGCTCAG J.560 

TGGAAGATGA TGACAACCAG AAGACATGAG CTAAGGGTAA GGGACTGTTC TGAAGAACCT 1620 

TTCCATTTAG TGATCAAGAT ATGGAAGCTG ATTTCTGAAA ATGCTCAGTC TGTACTCTAA 1680 

TTATTTATGG TACCATTTGA ATTGTAACTT GCATTTTAGC AGTGCATGTT TCTAATTGAC 1740 

40 TTACTQGGAA ACTGAATAAA ATATGCCTCT TATTATCAA 1779 



25 



35 



45 <2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2110 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 
55 GCGGCCGCTG CAGCCCGGAG CTGAGCTAGC CGTCCGAGCC GAGCCGTCCG AGCCQGGGAA 60 
GCCGGCGCC?r GCTGCCGCTC GTGGCGGCCA GAGGAGAGGA GAGGCAGCAG CATGGCGAGT 120 
GTCCTCTCCC GACQCdTOG AAAGCGGTCC CTCCTQGGAS CCCGGGTGTT GGGACCCAGT 180 

60 
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GCCTCGGAGG GGCCTCQGCT GCCCCACCCT CGGPiCCCACT GCTAGAAGGG GCCGCTCCCC 240 

AGCCTTTCAC CACCTCTCAT GACACCCCCT GCCAGGAGCA GCCCAAGGAA GTCCTTAAGG 300 

CrcCCAGCAC CTCGGGCCTT CAGCAGGTGG CCTTTMAGCC TGGGCAGAAG GTTTATGTGT 360 

GGTACGGGGG TCARGAGTCC ACAGGACTGG TGGWGCAGCA CAGCTGGATG GAGGGTCAGG 420 
TCACCGTCTG GCK5CTGGAG CAGAAGCTGC AGGTCTGCTG CAGGGTQGAG GAGGTGTGGC - 480 
TGGCAGAGCT GCAGQGCCCC TGTCCCCAQG CACCACCXCT GGAGCCCGGA GCCCAGGCCX: 
TXXXrCTACAG GCCCGTCTCC AGGAACATCX3 ATGTCCCAAA GAGGAAGTCG GACGCATQGA 

15 AATCGATCAG ATCATGGCGG CCATCGTGCT GACGTCCCTG TCCTGCAGCC CTGTTGTACA 660 

GAGTCCTCCC GGGACCGAGG CCAACTTCTC TGCTTCCCGT GCGGCCTGCG ACCCATGGAA 720 

GGAGAGTOGT GACATCTCGG ACAGCXSGCAN CAGCACTACC AGCGGTCACT GGAGTGGGAG 780 
CAGTGCyiGTC TCCACCCCCT CGCCCXXXTCA CCCCCAGGCC AGCCXTCAAGT ATTTGGGGGA 
TCCTTTIGGT TCTCCCCAAA CTGATCATGG CTTTGAGACC GATCCTGACC CTTTCCTGCT 
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25 QGACGAACCA GCTCCACGAA AAAGAAAGAA CTCTGTGAAG GTGATGTACA AGTGCCTGTG 960 



GCCAAACTOT GGCAAAGTTC TGCGCTCCAT TGTOSGCATC AAACX3ACACG TCAAAGCCCT X020 

CCATCTGGGG GACACAGTGG ACTCTGATCA GTTCAAGCGG GAGGAGGATT TCTACTACAC 1080 

AGAGGTGCAG CTCAAGGAGG AATCTGCTGC TGCTGCTGCT GCTGCTGCCG CAGACCCCXIA 1140 

GTCCCIGQGA CTCCCACCTC CGAGCCAGCT CCCACCCCCA GCATGACTGG CCTGCCTCTG 1200 

35 TCaXXrrCTTC CACCACCTCT GCACAAAGCC CAGTCCTCCG GCCCAGAACA TCCTGGCCCG 1260 

GAGTCCTCCC TCCCCTCAGG GGCTCTCAGC AAGTCAGCTC CTGGGTCCTT CTGGCACATT 1320 

CAGGCAGATC ATCCATACCA GGCTCTCCCA TCCTTCCAGA TCCCAGTCTC ACCACACATC 1380 

TACACCAOTG TCAQCTCGQC TOCTGCCCCC TCCGCCGCCT GCTCTCTMTC TCCGGTCCGG 1440 

AGCCGGTCGC TAAGCTTCAG CGAAGCCCCA GCAGCCAGCA CCTOCGATCA AATCTCATCT 1500 

45 GATCGTCACT TCTCCACCCC GGGCCCAGAG TCGTCCCAGG AAAGCCCGAG GQGAGGCTAA 1560 

GAACTGCCGC AAGTCTATGG CATOGAGCAC CGGGACCAGT GGTGCACGGC CTGCCGGTCG 1620 

AAGAAGGCCT GCCAGCGCTT TCTCGACTGA GdOTGCTGC AGGTTCTACT CrGTTCCrQG 1680 

CCCIGCCGGC AGCCACTCAC AAGAGGCCAG TGTGTCACCA GCCCTCAGCA GAAACCGAAA 1740 

GAGAAAGAAC GGAAACACGG AGTTTQGGCT CTCTrGGCTA AGGTCTAACA CTTAAAGCAA 1800 

55 TmCTCCCA TTCTCCGAAC ATTTTATTTT TTAAAAAAAA GAAACAAAAA TATTTTTCCC 1860 

CCTAAAATAG GAGAGAGCCA AAACTGACCA AGGCTATTCA GCAGTGAACC AGTCACCAAA 1920 

GAATTAATTA CCCTCCGTTT CCCACATCCC CACTCTCTAG QGGATTAGCT TCTGCGTCTC 1980 

60 
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AAAAGAAGGA WirAGCTCGTT CTGCTTCCTG CTGAGTCGGT GAATTCTTTG CTTTCTAAAC 2040 
TCTTCCAGAA AGGACTCTCA GCAAGATGAA TTTACrmC TTAAAAAAAA AAAAAAAAAA 2100 
5 AAAAACTCGA 
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10 (2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 938 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDECa^ESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 
20 GGCACAGGAA AAAAAAGAAA AAAGAAAAAA GAAAAAAGTT TTTGTAOCCA CAGATTAGCA 60 

TTTTCTTCAT CTTTGAAAAA AGTTTAAGCT ATGTCCTAAT TTAAAAATGA GCACAAACTA 120 

CTTAACAGAT GTCTCTTCCC TCTTCTCTTA CTTAAATTAT CTTTATTTTC ACCATCACCT 180 

CCCAGTGCCG AACACCTGAN CTCTGTGTTT TGTGGTTGGA TCCTGGGTTG CCAAGTTCCT 240 

ATTTQGTCAG TCCCK5GCCT GTGGGGCGGT CTCAGGAAGT GGCATGCTCT TCAMGRAGGA 300 

30 TCGTTCATYT CCACTATAAC CAWrTTGTTA ATAATAGTTG ATAATTCCCA GCTTTTACCA 360 

GATGARTTTT GACTTATTTT TCCTCCTTTG ACCTQTTCAA AGCTAACATA TCTCGGTCAG 420 

TTCGGAGAGG GTGGQGGATT TGAGAATGTG AGGAGGAGTG GGGTTAGAAT GGGTTTQCCT 480 

ATCTCGGCAA GGAAAGAGTT CCTAGTCGAT TGGGCACAAT GACAAAATGA TTCCATGGAT 540 

AGAATCGTCC CATOTTCCTG GAACACCTCA CGTGTTGTGA ACGCCTTAAA TTCCTGCCAT 600 

40 CCCTTCTCTG ATTCCCCACC TCCCTGTAGT TTCCACAGGA TTTATCTCTC TGTACCCCCG 660 

TCCTCCAACT CTACTCTCTC AGCCTCTCCT CCATCCCITA CTTCCCTTCT AAATTCCAGG 720 

AGATGACCTC ACITIGCAAA GCAAATTGGA GCCACCAAAT TGTAGCTCTC CTCQGTQGAA 780 
ACTGCATCTG TGCTCATCCC TGCACCTTCT TGCAGAAAGC CGCCCCCTCA QGCCAAGAOG 
. A!C?rGCCTGGC CCCCATOGGA GACTCAGACA CTTTGACCCC TrGTGACTTC AGCATCTCCC 
50 TdTTAAAGA TTCTCTCCCA ACATTCAGTC GTGCTOGA ^38 
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55 (2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1551 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRAND(EDNESS: double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 212: 
AGGCIGGACT AAGCATAGAG AACCAGGAGA GAAAGAAAGA TTTAAGAGAC TG^GTAATAT 50 

TTinGftCAG ATCATTTAAG AAACTGAGTA ATTTTTTTTT TCTCCAAAAG aXATGGGTT 120 
ITr m iTgr ITlUn T T T T CTCTATTTGG CACTTTCTAG GGATTGGTCT ATAAAi-i-i-rr • 130 

TCAAAGATCA TAGGATAAAT TTCTTTGTAG CAACTTCCTA TTTTAGTGTT TATGTTAGGG 240 

GARCCCCARG TGTCCdGCT GATACGCCAT TAGGGCCACT TCTCAGCCTC TGGCTACATC 300 

15 ATAATCCTTT imTCTATC TTGCCAAAGT TTCCMGAAAA TTKAKGTTTT CTAATTTTAA 350 

AAAAATTCGT TGTGGAGATG GGATGGGACC TCTTTATAAG CCCTGAAAAT AAGTGATTTN 420 

TITTAAOTGC TATTCTGCTA TAAACCTGAT TCTCACTTTT TTCTGTAGAC AACiJGTTTTT 430 

TATAATATAT CTATTTTOTG TGGACATTAT TTCCTTTTAA CCAATACTGA AATTCCATAG 540 

TOTAWACTTT CTCCACATTT TCTTTGATTA ATACTTYCTT AAAATAGACA CTTGGATTGG 630 

25 CACCAGCTOT CACCAATAAA GCTGCCCTGA ACATTGTCAA TCAATCCTGT TAACCAATTT 650 

GAGAATTTTT CTGGAATOCT TAGTTAGGGA TGAAATTGCT QGGrTTATAGG TATG5CTATG 720 

CTTGATATAC TTTTCTCCAG AATGTCTACA CCTGrGTCTA CACCACATCT CCAGAGATAG 730 
GQGAATCTTA TGTCCCTGCT AACTGCTCTC GTTATTTAAT TTTCTGACAT TTGCCGCCGC 840 
CGCCGCCCCC TGCCCCCAAC ACACACATGG TATAAAGTGG TAGTTTCTTG TTTTAAATTG 900 
35 AACTTTTCAA TGATTTGAAT TTGGGCATTT CTTTGTATCC TGAGTTATTT TCGTTTCCCG 950 

TTATCTGAAT ATCCTTTTCC TATGCTTTAA CTACTTTTCT AATTTGTCCC TTTTTTOGGT 1020 

TATCAAATTC CAGGCCATTG TCTATTCCAT OTrCACTTTT GGGTATTGGA AACATCTTTC 1030 

CATTCTOTAG CCTCTCTGTT GAACATAAAT CTTGATTTTT ATGTAATCAG ATTTTTCTCC 1140 

TTACGGTTAT GTTCTTGGAA TTTTATTTAA GAAATCTTTT TCTATCCTGA GACCACAAAA 1200 

45 ATGTCCCCAC CATTTTCTTC TGTTTCATAG TTTTGCCTTG TATGITTAAT CCTTTAAGGC 1250 

ATCTCTACyrT CATTTTATAT GGTGTOAAAT AGTTCTTATT CATTTATTCA ACACATATTG 1320 

GTOGfiCTSCC TGCTCATGGT AGTACTCTTC AGAGTACTTT GTATATATTT GTGAACACAT 1330 

ATTCTIGCCC TCGAAGCTTA TGTTQTCNTT CAAGt3TAGAT CCOTACTCGG TTTCCACCTG 1440 

TirrCTTCAG CCCTCAGGAT GAATTCCACA ATTTTACACA TAGCACCAGT TAAGGAATAG 1500 

55 GCTTTATTCG AGAAAAGGAA GGCITATTAG ACCAGCATCA GCAAAAAAAA A 1551 

60 (2) INFORMATION FOR SEQ ID NO: 213: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 997 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID ::0: 213: 

10 AGAGAGTCCT CAACAGAACC TAATCATGCT GGCACCCT.iA TCTC^.r^rTT CTAGCCTCCA 60 

GAACTGAGAG AACATAAACT CCAGTTGrTT AAGCTACCC\ GJCrArGGTA TTTGrTATTA 120 

TAGCCCAAGC TAAGTCAGGT GGAAAGGCAG AAATATTTTG ACSAAGAPrC.-. rTTCTAC-.iA 130 

AACAGACyiTG TTCTAAATGA AATGGCCAGA TATTTCATC? TCTTCATACT AGTATTTArO 240 

AAAGTTTCAT TAAACACCAC TTGGCCAGCA CCCAGGCCTG CCACCCTCAG AACGGCAAAC 300 

20 AAAAGCAAAT GATTTGAGGA ACAAAAGAGT GGACACAGAG CTTCTCAGAA GATGGCTCCA 360 

TCTTCTGAGA TCATCTTCTG AGATCATCAA TITrCTGCAC CTGArGTCCT irrrCCAATtG 420 

TAGTAGATAA GAGCAAAGAC ACTTCCTGAT CCTGTGGAAA ATC-CTGGAGC CCTGCTGArG 430 

GAGAGGCTGA CACTGGGACC AACAGAAGGC CC-GACATTTA 'TnC-CTC-CAG CCCTTCTGZA 540 

CCTGGGCCCT CTTCAGGCCT TGTACCTTGC ACTCCCCATS CCAJC^T^in'AGC AXICTGGTAAG 600 

30' CTGAAGTTAG GTATTTGAAG AGATAATTTG CCCCCAACAA AGAATTACTr AAAAGAA.AAA 560 

GGAAACCACT AAATTCCACT TGACAAACCA GTTT3TTCA3 'rrrTGACTTT TGCAAATTirG 720 

AAACTTTCTC TTTGGCACCA TATGATTCTG TTACATTAIrG GCTCArCAAT GCTAAGAIAC 730 

ACAGCTAGGT CTACCAGCTG CCAGTGGTCA AGAATGAA^I^ AACCTCTCAG AGAGAGATZA 840 

GITTCTAATA ACCTAACAGT TTTCCTTGGS TATTACMAAA AJ^^J^-JL^AAA ITAGAATAAA 900 

40 ATCTCAGTGC CATGCAGGCA AGTACAGATA TGGAAJVTGAA AISCTCTG^CT ACAACTGCAA 



960 



GATT T GTTTG TTAATAAAAT TGATTGGGAT CACTCGA 997 



(2) INFORMATION FOR SEQ ID NO: 214: 



(i) SEQUQICE CHARACTERISTICS: 
50 (A) LENGTH: 1496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTIC»I: SEQ ID 130: 214: 

GAATTCGGCA CGAGTCACCA CAGATATCTT TGGCTTTCAG CCTC:y:CACA ATGCTGTCCA 60 
CTATGmTTT TTTAATCGAT TGACATCTCA TGAATCCAC^ AATTTAGCCG CrCTTCCATC 120 
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TTTTCCATCi TTCTCATAGC TTCATCACC5C ACGATGGAGG TCACTTCAGC ACTATCCGGA 180 

GCGGCGTCAC GGACAGATCR GTGAATTTCC TTTTCCTTTT TCTTGATGTA CCCSGATTGTC 240 
5 GACTCG1TAA CATTCAGCTC ATGGCCAACA GCACTGTAAC TCATGCCTGA TTGGAGCTTA ' 300 

TCCAACACGC QGAMTTTCTC CGTAAGGSAM ATCAMGGTCT TCTTTCC5CTT ACGAACACTG 360 
GGCARARCTT AARCACTACX3 CTTGGGGGCC ATTTTAGAAA GCAAAACCAC CCACAAAAftG - 420 

10 

CAGAAAAAAA ACyrGTCAGTA AACAC5ACTGN NGANftGGACT CTTTGTTTAC AGCACAGGAG 480 

CTGCGACTAG AAGGCGGCCSC TTCTCCCCAG TTCAAACTTC AGCTGGGAAC CTTACCTCCG 540 

15 . CCAACTCCAA ATTTTCACCC TCTCCGCATG CTCGGGAAAS AAACCCCCAG AACAGTACCG 600 

TGATGATTGA TTTTAGGGTT ACAAATACAT TTTAGCAAGT AAGTGAATTT GGCATTACGA 660 

ATTAATGATT AATCAAGGTC ACCTGTATTT CCATAGATAT GTAATTTTAT TTAAGCAGGT 720 

20 

TTATTATATT AAGGCX3GSGA GGCAGCGCCX3 AAGACTACAA GTTCCAGCAT GCACCGCGTC 780 
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CGGGCGGGTT CGGGCTCCCA GCGAGGGCTT CAGGGACGCC AGCCCGGAGG CATCGGCCGG 
25 AAGTGTCGTA GGGCAACCAC GTAGTACTCT CTGCGCATGT GCAAAGCGCT GTCGGGGGCC 

GCCCTAC3CTG CCGTCGCCGC CGCCQGGGCT CTATGGTCTC TCCCTAGAGC TTTGCCGTTG 960 

GAGGCGGCTG CTGCGGTCTT GTGAGTTTGA CXIAGCGTCGA GCGGCAGCAA CATGGAGGAA 1020 

30 

TTCGACTCCG AAGACTTCTC TACGTCGGAG GAGGACGAGG ACTACGTGCC GTCGGGTGAG 1080 

CGATTCCGCC TGAGGCGAGA AGCGAATTGC CCCGCCCCAC GCCTCACX?rG AGGCXXTGCTC 1140 

35 TGCCCCCGCG GGCGTCIGCC CTGTGGCCCA GGTGGTCCAS GGGGGCTCCT GTTCTCGAGC 1200 

GTCCGCTCCC TCAQGCCCCT CATQ:nx:GGC CGCTCCX3GCC CGAGGCGTGT GCGCGTGGCG 1260 

GTTCTCTCCT CCXrCTCCXGT TQGGCAGCTC CGGCCGCCGC CCCCTCTTGC AGCGCGGGAA 1320 

40 

CGGCACATGG ACACXSGCCCX: TTGTCGCTAG GGAaSCTCGT CGGTCAGCCC CGAACGACAA 1380 

CGC1GCTTCA GAAGTCGGGG CGGCAGTTCG AGCCTTGGAA GTTTTTTTCA GCCCTGGCCC 1440 

45 GAGAGAGCTG CTGGCCAACA ACCCGTCCAA GATAGAGCTG TCCGNTCTCC GNCTGG 1496 



50 (2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1308 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 



60 TTCGCANCNG QGAGAGGGAA AGAGGAGGAA ATGGGGTTTG AGGACCATGG CTTACCTTTC 
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(2) INFORMATION FOR SEQ ID NO: 216: 



468 



CTCCCTTTGA CCCATCACAC CCXMTTCCT CTTCTTTCCC TCTCCCCGCT GCCAAAAAAA 120 

AAAAAAAAGG AAACGTTTAT CATGAATCAA CAGGGTTTCA C3TCCTTATCA AAGAGAGA1G 180 

5 

TQGAAAGAGC TAAAGAAACC ACCCmOTT CCCAACTCCA CTTTACCCAT ATTTTATGCA 240 

ACACAAACAC TCTCCTTTTO GGTCCCTTTC TTACAGATGG ACCTCTTGAG AAGAATTATC 300 

10 GTATTCCACG TTTTTAGCCC TCAGGTTACC AAGATAAATA TATGTATATA TAACCTTTAT 360 

TATTGCTATA TCTTTGTGGA TAATACATTC AGGTGGTGCT GGGTGATTTA TTATAATCTG 420 

AACCTAGGTA TATCCTTTGG TCTTCCACAG TCATGTTGAG GTGGGCTCCC TGGTATGGTA 480 

AAAAGCCAGG TATAATCTAA CTTCACCCCA GCCTTTGTAC TAAGCTCTTG ATAGTGGATA 540 . 

TACTCTTTTA AGTTTAGCCC CAATATAGGG TAATGGAAAT TTCCTGCCCT CTQGGTTCCC 600 

20 CATTTTTACT ATTAAGAAGA CCAGTGATAA TTTAATAATG CCACCAACTC TGGCTTAGTT 660 

AAGTX3AGAGT GTCAACTCTG TQGCAAGAGA GCCTCACACC TCACTAGGTG CAGAGAGCCC 720 

AGGCCTTATG TTAAAATCAT GCACTTGAAA AGCAAACCTT AATCTGCAAA GACAGCAGCA 780 

AGCATTATAC GGTCATCTTG AATGATCCCT TTGAAATTTT TTTTTTGTTT GTTTGTTTAA 840 

ATCAAGCCTG AQGCTGGTGA ACAGTAGCTA CACACCCATA TTGTGTGTTC TGTGAATGCT 900 

30 AGCTCTCTTG AATTTCGATA TTGGrrTATTT TTTATAGAGT GTAAACCAAG TTTTATATTC 960 

TGCAATCCGA ACAGGTACCT ATCTGTTTCT AAATAAAACT GTTTACATTC ATTATGGQGT 1020 

ATCTATGACC TTCATTTTCC AAGAAATAGA ACTCTAGCTT AGAATTATGG ATGCTCTAAA 1080 

ATGTCAGAAT GGGAACTCTC CTCGAAGTTC TCCCAAACTC AGAGACAGCA CTGCCTTCTC 1140 

CTAAATGATT ATTCTTTTCT CCCTGTTTTC TGGTATTTTC TAGGCATCCT TCTCACCACA 1200 

40 QCXrATAACCC TTTTTTACTT CCATTAGGCC GTATAACTGG NGGGACNGCT GGTCGGTATA 1260 

TAATACTCGT WCCAACAMAG QGGTTCTGGA TGTACACMAG GTTATCIT 1308 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1705 base pairs 

(B) TYPE: nucleic acid 
.(C) STRANDEDNESS: double 
<D) TOPOUJGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

TCGCCATOGA AGCGCTAGAA GGTTTAGATT TTGAAACAGC AAAGAAGGAT TTCCTTGGAT 60 

CTOGAGACCC CAAAGAAACA AAGATGCTAA TCACCAAACA GGCTGACTGG GCCAGAAATA 120 

60 
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TCAAGC5AGCC CAAAGCCXXX: GTGGAGATGT ACATCTCAGC AGGAGAGCAC GTCAAGGCCA 
TCGAGATCTG -TOGrGACCAT GGCTGGGTTG ACATGTTGAT CGACATCGCC CGCAAACTGG ' 
ACAAGGCTGA GCGCGAGCCC CTCCTGCTGT GCGCTACCTA CCTCAAGAAG CTGGACAGCC 
CrGGCTAlGC TGCTGAGACC TACCTGAAGA TGGGTGACCT CAAGTCCCTG GTGCAGCTGC 
AGTGGAGACC CAGCX3CTCGG ATCAGGCCTT TGCTrTGGGT GAGAAGCATC CTGAGTTTAA 
GGATGACATC TACATGCCGT ATGCTCAGTG GCTAGCAGAG AACGATCGCT TTGAGGAAGC 
CCAGAAAGCG TTCCACAAGG CTGGGCGACA GAGAGAAGCX5 GTCCAGGTGC TGGAGCAGCT 
CACAAACAAT GCXX?rGGCX3G AGAGCAGGTT TAATGATGCT GCCTATTATT ACTGGATGCT 
GTCCATGCAG TGCCTCGATA TAGCTCAAGA TCCTQCCCAG AAGGACACAA TGCTTGGCAA 
GTTCTACCAC TTCCAGCGTT TCGCAGAGCT GTACCATGGT TACCATGCCA TCCATCGCCA 
CACGGAAGAT COTrTCAGTG TCCATCGTCC TGAAACTCTT TTCAACATCT CCAGGTTCCT 
GCTGCACAGC CTGCCCAAGG ACACCCCCTC QGGCATCTCT AAAGTGAAAA TACTCTTCAC 
CTTQGCrAAG CAGAGCAAGG CCCTCGGTGC CTACAGGCTG GCCCGGCACG CCTATGACAA 
GCTCCXyiGGC CTCTACATCC CTGCCAGATT CCAAAAGTCC ATTGAGCTGG GTACCCTGAC 
CATCCGCSCC AAGCCCTTCC ACGACAGTCA GGAGTTGGTG CrCTTGrGCT ACCGCTGCTC 
CACCAACAAC CCGCTGCTCA ACAACCTGGG CAACGTCTGC ATCAACTGCC GCCAGCCCTT 
CATCTTCTCC GCCTCTTCCT ACGACGTGCT ACACCTGGTT GAGTTCTACC TGGAGGAAGG 
GATCACTGAT GAAGAAGCCA TCTCCCTCAT CGACCTGGAG GTGCTGAGAC CCAAGCGGGA 
TGACAGACAG CTAGAGATTT GCAAACAACA GCTCCCAGAT TCTTGCGGCT AGTGGGAGAC 
CAAGQGACTC CATCGGAGAT NAGGACXrGT TCACAGCTAA QCTRAGCTIT GAGCAAGGTG 
GCTCARAGTT CXSIGCCAGTC GTGGTGAGCC GGCTGGTOCT GCX3CTCCATG AGCCGCCGGG 
ATCTCCTCAT CAAGCGATCG CCCCX:ACCCC TCAGGTGGCA ATACTTCCGC TCACTGCTGC 
CTCACGCCrc CATTACCATG TGCCCCTCCT GCTTCCAGAT GTTCCATTCT GAGGACTATG 
AGTTCCTCGT GCTTCAGCAT GGCTGCTGCC CCTACTGCCG CAGGTGCAAG GATGACCCTG 

gcccatcacc agcatcctog ggacggcctc caccctctgc cxxsctttggg gtctgctggg 

CTC1X5AAGGA GAATAAAGAG TTAAACTGTC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAANA 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1705 
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(2) INFORMATION TOR SEQ ID NO: 217: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5, 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

AGCAAATCAC CTTAACGATC TGGAATGAAA CTGTGACCAG TGCCGCCCTG GGTGGTTCTG 60 

10 GAGAGACTGC CGTCTTCTTG TTTGGCCATA GGTGCTGGGG CCCCGGCTTC AGTCACTGTC 120 

TCAGACAGKA GTCCCGATAA GCAGATCACC AGTCCTCCAC TGTCCTTCCT GTCGGCCTTG 180 

CTCCATGAGA AGATAGCTGC TTCCTCCCTC TTTTCCTACA CTGTAAATTA TTGTTTTACA 240 

ATTGAOTGYC TTAATAATAG TYTACAAATA CTATGTATTT ATGCAAAACT GTTAAAGTTC 300 

TCATCTGTTA TCATTGGATA CTTGGTCTTG TCAGTAGTGG TCAGCATTG6 GTTGTGAGCT 360 

20 TCmrCTACTC CATACGTCTT TATCCTGCTA TGCATTTTAC ATTGTGTGTT CACATCTATT 420 

CCAAGGAGCC TTCCTAGAAA CAACACTC3GC GGTTCCTGCA GGCCAGGCAG GCATTGGCCC 480 

AIGCTGTOTC CCATAGGAGC CAATGGAAAG AACGTAGCTT GGTCTGCTAG CCAGCCC?rGG 540 

QGIGGCGCAG GCCAGGCAGC CICTOCACCA GAGTCCAGCA CCTGCCCATT CCCCAGTCAC 600 

ACAATCATAC TCTTCTTTCA TAGAGATTTT ATTACCACCT AGACCACCCT AGTTTTCCTC 660 

30 TCTGTTAOTG TCCTGAGCTC TTTTGCAACA AAATGTAGGT ACAGACCAAT CCCTGTCCCT 720 

TCCCCAATCA GGAGCTCCAC ACCATGftCTT GTTTGGTTTT CCAGAAGCTG CCAGTGGGTT 780 

CCCGTGAATT GCOrTAAGAT ATCGATGATK TTTTTTATTG TTITrCTTCT TGTTTTTTrA 840 

AATAATATAT TTAAAGGCAG TATCTTTTGT ACTGTGAATT TGCAGTAGAA GATGCAGAAT 900 

GCAcnrrrr tttacttctg ttcgtgtgta ttgtatatag tgtgtgtgct tcttgtgatg 960 

40 AAAATAAACT TTTTCTTTAT AAAAAAAAAA AAAAAAAAC 999 



25 



35 



45 (2) INFOKMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCyPH: 941 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 
55 GGCACGAGTA GCATTTCATT TAATCTGCAG GTATATTCTC CCAACAGTTT ATTGTCATGT 60 
GATGTCCTCA GCCAAGATTG TRAGGCAGAG AGGAGCTGTC CCAACCTACT ATACCACCGA 120 
GGCIGGAGAG ATCATATTTT TQGTATTAAA CTGGAGTCTC TCCATCCTTC ACATTGTTGA 180 

60 
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TCTCCTCTGT AGCAAACCGG AAAAGTCAGT GACAGAAGAT GCCGCTAGCG GTTTGAGCCA 240 

GAGAATGACA GCTCTGGTTT GGAGAAAAGG GCCGGATGGT GGCTCTAGAA AGCCCATCCT 300 

TCTGCTCTTC TTTTTTCTCC CCCTTATATT GTGCTTTCAT TCATTCATTC ATTCATCAAA 360 

CATTTOTTGA GCACCTATTA TGTGTCAAGC TCTGTGCTAG CCTCTGGAAA ACCTGCCCTC 420 
ATC?rAGCTCA CTCTCGAGTA GGAGAAACAA TGACTACACT ATGATAAGCA CGGGTTGTCA ■ 480 

QGGTCTCACA GAGCACKSGC CCCTCATCCA GACCGATGAG GTCAAAGAAG GCATCCAGGC 540 

GAGGATGGTG TCAGAGCTAA CTGAAGAATG AGAGGGAGCT GCACCASCAG GGGTTGGAAC 600 

15 TGAAGGTGGC AGTGCCIGGA GTCTTGATTC CAGCAGAGGG AGAGCAGTCT GTGAAAAGGC 660 

ACCAAGGGTG GGAGAGGGCA GAGCACATGG AGGAACTTCA GGTAGTTCTG GATGGCSCTG 720 

GGGCAAAGCT AGAGAGGTAA GAAGAATCTA CAAATGTTCC TCGAGTTACA TGAACTTCCA 780 

TCCCAATAAA CCCATTOGAA ACGAAAAATT TAAGTCAGAA GTGCATTTAA GGCTGGTCCG 840 
AGTAGAATCA TTTTTACAAC GAATTGATCA CAACCAGTTA CAGATGTCTT 



20 



45 



55 



900 



25 CCACTCCCAC TGClTCACXrr GACTAGCCTT TAAAAAAAAA A 941 



30 (2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 575 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: double. 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

40 TAA(?rGGAAT CCCCCGGQGT TGCAGGGAAT TCGGCACGAG GCATTCTGAG AAGCTTAAGA 60 

CATACTTTGA AGACAACCCT AGQGACCTCC AGCTGCTGCG GCATGACCTA CCTTTGCACC 120 

CCGCAGTOGT GAAGCCCCAC CTGGGCCATG TTCCTGACTA CCTGGTTCCT CCTGCTCTCC 180 

GTCGCCTOGT RCGCCCTCAC AAGAAGCGGA AGAAGCTGTC TrCCTCTTGT AGGAAGGCCA 240 

AGAGAGCAAA GTCCCAGAAC CCACTGOGCA GCTTCAAGCA CAAAGGAAAG AAATTCAGAC 300 

50 CCACAGCCAA GCCCTCCTGA GGTTGTTGGG CCTCTCTGGA GCTGAGCACA TTGTGGAGCA 360 

CAGGCTTACA CCCTTCGTGG ACAGGCGAGG CTCTGGTGCT TACTGCACAG CCTQAACAGA 420 

CAGT T CTXSGG GCCQGCAGTC CTGGGCCCTT TAGCTCCTTG GCACTTCCAA GCTGGCATCT 480 

TCCCCCTTCA CAACAGAATA AAAATTTTAG CTGCCCCAAA AAAAAAAAAA AAAAAAAAAA . 540 

CTCGAGGQGG GGCCCGTACC CAATTCGCCC TATAA 575 



60 
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(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3018 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEia^ESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

GCCAGCCTTA CAGGTTTTAC GTGAAATGAA AGCCATTGGA ATAGAACCCT CGCTTGCAAC 60 

15 ATATCACCAT ATTATTOGCC TGTTTGATCA ACCTGGAGAC CCTTTAAAGA GATCATCCTT 120 

CATCATTTAT GATATAATGA ATGAATTAAT GQGAAAGAGA TTTTCTCCAA AGGACCOGGA 180 

TGATGATAAG TTTTTTCAGT CAGCCATGAG CATATGCTCA TCTCTCAGAG ATCTAGAACT 240 

20 

TQCCTACCAA GTACATGGCC TTTTAAAAAC CGGAGACAAC TGGAAATTCA TTGGACCTGA 300 

TCAACATCGT AATTTCTATT ATTCCAAGTT CTTCGATTTG ATTTGTCTAA TGGAACAAAT 360 

25 TGATGTTACC TTGAAGTGGT ATGAGGACCT GATACCTTCA GCCTACTTTC CCCACTCCCA 420 

AACAATGATA CATCTTCTCC AAGCATTGGA TGTGGCCAAT CQGCTAGAAG TGATTCCTAA 480 

AATTTGG6AA AGATAGTAAA GAATATGGTC ATACTTTCCG CAGT6ACCTG AGAGAAGAGA 540 

30 

TCCTGATGCT CATGGCAAGG GACAAGCACC CACCAGAGCT TCAGGTGGCA TTTGCTGACT 600 

GTGCTGCTGA TATCAAATCT GCGTATGAAA GCCAACCCAT CAGACAGACT GCTCAGGATT 660 

35 GGCCAGCCAC CTCTCTCAAC TGTATAGCTA TCCTCTTTTT AAGGGCTGGG AGAACTCAGG 720 

AAGCCTQGAA AATGTTGGGG CTTTTCAGGA AGCATAATAA GATTCCTAGA AGTGAGTTGC 780 

TGAATGAGCT TATGGACAGT GCAAAAGTGT CTAACAGCCC TTCOCAGGCC ATTGAAGTAG 840 

40 

TAGAGCTGGC AAGTGCCTTC AGCTTACCTA TTTCTTGAGGG CCTCACCCAG AGAGTAATGA 900 

GTGATTTTGC AATCAACCAG GAACAAAAGG AAGCCCTAAG TAATCTAACT GCATTGACCA 960 

45 GTGACAGTGA TACTGACAGC AGCAGTGACA GCGACAGTGA CACCAGTGAA GGCAAATGAA 1020 

AGTGGAGATT CAGGAGCAGC AATGGTCTCA CCATAGCTGC TGGAATCACA CCTGAGAACT 1080 

GAGATATACC AATATTTAAC ATTGTTACAA AGAAGAAAAG ATACAGATTT GGTGAATTTG X140 

50 

TTACTGTGAG GTACAGTCAG TACACAGCTG ACTTATGTAG ATTTAAGCTG CTAATATGCT 1200 

ACTTAACCAT CTATTAATGC ACCATTAAAG GCTTAGCATT TAAGTAGCAA CATTGCGGTT 1260 

55 TTCAGACACA TQGTGAGGTC CATGGCTCTT GTCATCAGGA TAAGCCTGCA CACCTAGAGT 1320 

GTOGGTGAGC TGAOCTCACG ATGCTGTCCT CGTGCGATTG CCCTCTCCTG CTGCTGGACT 1380 

TCTGCCTTTG TTGGCCTGAT GTGCTGCTGT GATGCTGGTC CTTCATCTTA GGTGTTCATG 1440 

60 
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CAOTTCTAAC ACMTTGGGG TTGGGTCAAT J^STTTCCTAA TTTCAGGATA TTTCGATGTC 
AGAAATAACG CATCTTAGGA ATGACTAAAC AAGATAATGG CAGTTTAGQC TGCACAACTG 
GTAAAATCAC TGTAGATAAA TGTTGTAATT AGTGTACACG TTTGTATTTT TGTTAATATA 
GCCGCTGCCA TAGTTTTCTA ACTTCAACAG CXZATCAATGT TrCATGTCTC CCTTTrmT 
TTGTCTATAG CTGTTACCTA TTTTAGTGGT TGAAATGAGA GCTAGTGATG ACAGAAGGAT 
GTOGAATCTC TTCTTGACAT CATTGTCTAT TGCTGGTAAT CAAGTTGGTA ACGACTACTT 
CTAGCAGCTC TTACCACTAT GACTTAAGTG GTCCTGGAAG GCAGTAAGTG GAGGTTTGCA 
GCATTCCTGC CTTCATGAGG GCTTCTACCA CTGACCACTT TGCACGTACC TGGCTCOCAG 
ATTTACTTAG GTACCCCACG AGTCGTCCAC ATAAGCAGCT TCATCTTTAC CTTGCCAGAG 
TTCACAATTA TGGGATACTC TAGTCTACTT ATACTTGTGT TCCCATCTGT CTGCCATCCT 
CTCAAGGCCA GGACCCAGTC ATACATCCTT AGAAACCAAA GTATGGTTTT TGTTTTCTCT 
TGGAATCTCA GGTCTTAAGG CATTTAATTG AGGGACAAAA AAAAAAAAAA GCCGATATAG 
TAGCTAGCTA CTTAAGCATC CATGGGTATT GCTCCATATC AAAGCAGATT TGCAGGACAG 
AAAGAGTAAA TTAGCCTTCA GTCrTGGTTT ACAGCTTCCA AGGAGAGCCT TGGSCACCTG 
AAATOTTAAC TCGGTCCCTT CCTGTCTCTA GTTCATCAGC ACCTGCAGAT GCCTGACTCT 
TOTTAGCCTT ACTATTCAAT ACAGTCCTTA GATTCACGGT ATGCCTCTTC CTATCCAGGC 
ACCTATTCTG AATCACCATG TTGCTCTCCA GCTAGAGTTG ATAGGAGAAA ATCCATTTGG 
GTAGATGGCC TATGAATTTG TAGTAGACTT TCAAAATGAG TGATTTGTTA GCTTGGTACT 
TTTAACTrrG TQGTACAGAT CCTCCAAACC CATACTCTGA GCAATTAACT GCCTTGAACA 
TAGAGAAAAA TTAAQGCCTC ACAGGATGAG TCTCXATrCT CTGTAAATGC TTAITrTATC 
ATACTCTTTA GCCTCTAACT ATOAGTAAAA TGTTCTCTTC GGiCCXXXSTGT QGTGACTCAC 
ACCTCTAACC TCAGCACTTT GQGAQGCAGA GGTQGGAQGA TCACTTAGGT CCAGGAGTTC 
GAGACTAGCC TO3GCAACAT AGTGAGACAC CX3GATCTACA AAAAAATAAA AAGCCAGACT 
GGTGCTATCT ATCTCTOTCC CAGCTAATTG GGAGGGTGAG ATGQGAGGAT TGTTTGAGCC 
TAGGAGAGGG AQGT1X3CAGT GAGCCGTOAT CX3CACCACTG CACTCCAGCC TGGGCAACAG 
AGCAAGACCC TOTCTTQGAG AAACCAGAAT TTTQGAAGAG CAAATGQGGC TGAGTGCAGT 
QGCTCATGCC TGTAATOC 



1500 
1560 
1620 
1680 
•1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3018 



(2) INF0HMATIC3N FOR SEQ ID NO: 221: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 968 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTIC^I: SBQ ID NO: 221: 
GGCACGAGGG CCGCGGGACA TCCACGGGGC GCGAGTGACA CGCX3GGAGGG AGAGCAGTGT 
TCTGCTGGAG CCGATGCCAA AAACCATGCA TTTCTTATTC AGATTCATTG TTTTCTTTTA 
TCTGTGGGGC CTTTTTACTG CTCAGAGACA AAAGAAAGAG GAGAGCACCG AAGAAGTGAA 
AATAGAAGTT TTGCATCGTC CAGAAAACTG CTCTAAGACA AGCAAGAAGG GAGACCTACT 
NAAATGCCCA TTATGACGGC TACCTGGCTA AAGACGGCTC GAAATTCTAC TGCAGCCGGA 
CACAAAATGA AGGCXrACCXTC AAATGGTTTG TTCTTGGTGT TGGGCAAGTC ATAAAAGGCT 
TAGACATTQC TATGACAGAT ATGTGCCCTG GAGAAAftGCG AAAAGTAGTT ATAGCCCCTT 
CATTTQCATA CQGAAAGGAA. GGCTATQCAG AAGGCAAGAT TCCACCGGAT GCTACATTGA 
TTTTTGAGAT TGAACTTTAT GCTGTGACCA AAGGACCACG GAGCATTGAG ACATTTAAAC 
AAATAGACAT GGACAATGAC AGGCAGCTCT CTAAAGCCGA GATAAACCTC TACTTGCAAA 
QGGAATTTGA AAAAGATGAG AAGCCACGTG ACAWSTCATA TCAGGATGCA GTTTTAGAAG 
ATATTTTTAA GAAGAATGAC CATGATGGTG ATGGCTTCAT TTCTCCCAAG GAATACAATG 
TATACCAACA CX3ATGAACTA TAGCATATTT GTATTTCTAC TTTTTTTTTT TAGCTATTTA 
CTGTACTTTA TGTATWAAAC AAAGTCMCTT TTCTCCMAGT TGKATTTGCT ATTTTTCCCC 
TATGAGAAGA TATTTTGATC TCCCCAATAC ATTGATTTTG GTATAATAAA TGTGAGGCTG 
TTTTGCAAAC TTAAAAAAAA ATTTAAAAAA ACTGGAGGGG QGCCCGTACC CAANTCGCCG 
NATATGAT 



(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEISIGTH: 1404 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: sisQ ID NO: 222: 
CGTTTTCCGG CCGTGCGTTT GTGQCCGTCC GGCCTCCCTG ACATGCAGCC CTCTGGACCC 
CGAGGTTGGA CCCTACTGTG ACACACCTAC CATGCGGACA CTCTTCAACC TCCTCTQGCT 



TGCCCTGGCC TGCAGCCCTG TTCACACTAC CCTGTCAAAG TCAGATGCCA AAAAAGOCGC 
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CTCAAAGACG CTGCTGGAGA AGAGTCAGTT TTCAGATAAG CCGGTGCAAG ACCGGGGTTT 240 

GGTGGTGACG GACCTCAAAG CTGAGAGTGT QGTTCTTGAG CATCGCAGCT ACTGCTCGGC 300 

AAAGGCCCGG GACAGACACT TTGCTGQGGA TGTACTGGGC TATGTCACTC CATGGAACAG 360 

CCATGGCTAC GATGTCACCA AGGTCTTTGG GAGCAAGTTC ACACAGATCT CACCCGTCIG 420 

GCTGCAGCTG AAGAGACGTG GCCGTGAGAT GTTTGAGGTC ACGGGCCTCC ACGACGTGGA - 480 

CCAAGGGTGG ATGCGAGCTG TCAGGAAGCA TGCCAAGGGC CTGCACTITAG TGCCTCGGCT 540 

CCTGTTTGAG GACTQGACTT ACGATGATTT CCGGAACX5TC TTAGACA3TG AGGATGAGAT 600 

15 AGAGGAGCTG AGCAAGACCG TGGTCCAGGT GGCAAAGAAC CAGCATTTCG ATGGCTTCGT 660 

GGTGGAQGTC TGGAACCAGC TGCTAAGCCA GAAGCGCGTG GGCCTCATCC ACATGCTCAC 720 

CCACTTGGCC GAGGCTCTGC ACXIAGGCCCG GCTGCTGGCC CTCCTGGTCA TCCCGCCTGC 780 

20 

CATCACCCCC GGGACCGACC AGCTGGGCAT GTTCACXX:AC AAGGAGTTTG AGCAGCTGGC 840 

CCCCGTGCTG GATGGTTTCA GCCTCATGAC CTACGACTAC TCTACAGCGC ATCAGCCTGG 900 

25 CCCTAATGCA CTCCTGTCCT GGGTTCGAGC CTGCXrrCCAG GTCCTGGACC CGAAGTCCAA 960 

GTGGCGAAGC AAAATCCTCC TGGGGCTCAA CTTCTATGGT ATGGACTACG CGACXrrcCAA 1020 

GGATGCCCX3T GAGCCTGTTG TCX3GGGCCAG GTACATCCAG ACACTGAAGG ACCACAGGCC 1080 

30 

CCGGATQGTG TGGGACAGCC AGGYCTCAGA GCACTTCTTC GAGTACAAGA AGAGCCGCAG 1140 

TGGGAGGCAC GTCGTCTTCT ACCCAACCCT GAAGTCCCTG CAGGTGCGGC TGGAGCTGGC 1200 

35 CCX3GGAGCTG GGCGTTGGGG TCTCTATCTG GGAGCTGGCC AGGGCCTGGA CTACTTCTAC 1260 

GACCTGCTCT AGGTOGGCAT TQCGGCCTCC GCGGTGGACX; TGTTCTTTTC TAAGCCATGG 1320 

AGTGAGTGAG CAGGTQTGAA ATACAGGCCT NCACTCCX3TT TGCTGTGAAA AAAAAAAAAA 1380 

AAAAAAAAAA AAAAAAAAAA AAAA 1404 



40 
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(2) INFORMATION FOR SEQ ID NO: 223: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 base pairs 
50 (B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

CAGTCGACAC TAGTGGATCC AAAQAATTCG GCACGAGGGC AGGTCCAGGG 60 

GTCAGAAATC AGCTCTATTG ACGAATTCTG CCGCAAGTTC CGCCTGQACT GCCCGCTC3GC 120 

60 CATGGAGCGG ATCAAQGAGG ACCGGCOCAT CACCATCAAG GACGACAAGG GCAACCTCAA 180 
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CCGCTGCATC GCAGACGTGG TCTCGCTCTT C-.TCACGGTC Ar:XSrJC:----3C TGCGCCTGGA 240 

GATCCGCGCC ATGGATGAGA TCCi-GCCCGP. CCTGCGAGAG CT'SATS'^jSA CCArCSCACCG 300 

5 

CATGAGCCAC CTCCCACCCG fiCTTTWjCG CCSCCiGACG GTCAGCCSST GGCTGCAGAC 360 

CCTGAGCQGC ATGTCGGCGT CAG?,TGAGCT SS^rGACTCA CASSTGCtTTC AG.--rGCTGTT 420 

10 CGACCTGGAG TCAGCCTACA ACGCCTTCA;^. CCSCTTCCTG G^.TOCCTG?.^ CCCGGGGCAC 480 

TAGCCCTTGC ACAGAAGGGC AGAGTCTQAG GCGATGGCTC CTGGTCCCCT G7CC3CCACA 540 

CAGGCCGTGG TCATCCACAC AACTCACTGT CTGCAGCTGC CTGTCT'SirrG TCT'^CTTTG 600 

15 

GTGTCAGAAC TTTTGGGCCG GGCCCCTCCC Cl^^CAATAAAG ATGCTCTCCG ACCTTCAAAA 660 

AAAAAAAAAA AAAAACTCRG GGGGGGCCCG GCCCCAATCC CCCC^C: 707 

20 



(2) INFORMATION FOR SSQ ID NO: 224: 

25 (i) SEQUENCE CTARACTEPalSTICS: 

(A) LE^TGTH: 1334 base pairs 

(B) TY?B: nucleic acid 

(C) STPANDH3NESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCllPTICJ:: S3Q 13 y/j: 224: 
GGGGAACTGC AGTGACAGCA GGAGTAAGAG TG3GAC-GCftG GACAG^^-CtG GGiCACAGGT 60 
35 ATGGAGAGGG GGTTCAGCGA GCCr.AG;ySAG GGCAGACTAT CA3GG7CCCG GCGGTGAGAA 120 
TCCAGGGAGA GGAGCGGAAA CAGnAGAGGG GCAG^^AGACC GGGGCwTTG TGGTTTGCAG 180 
AGCCCCTCAG CCATGTTGGG AGCCAAGCC=. CACTGGCTAC CAGGTCCCCT ACACAGTCCC 240 

40 

GGGCTGCCCT TQGTrCTGGT GCTTCTGGCC CTGGGGQCCG GGTGGGCCCA GG«iSGGGTCA 300 
GAGCCCGTCC TGCTGGAGGG GGASTGCCTG GrGGTCTGTG AGCCTGGCCG AGCTGCTGCA 360 
45 GGGGGGCCCG GGGGAGCAGC CCTGGGftGAG GCaCCCCCTG GGCGftGTGGC ATTTGCTGCG 420 
GTCCGAAGCC AMCACCATGA GCC-JGCAGGG GAAACCC-GCA ATGGCACGAK TC-GC-GCCATC 480 
TACTTCGACC AGGTCCIGGT GAACGAGGGC GSrGGCTTTG ACCGGGCCTC TGGCTCCTTC 540 

50 

GTAGCCCCTG TCCQGGGrTGT CTACAGCTTC CGGTTCCATG TGGTGSiGGT GC.-rAACCQC 600 
CAAACTGTCC AGGT6AGCCT GATGCTGAAC ACGTGGCCTG TCATCTOJSC CTTTGCCAAT 660 
55 GATCCTGACG TGAOCOQGGA GGCMCCACC AGCTCTGTGC TACTGCCCTT GGSCCCTGGG 720 
GACCGAGTGT CTCTGCGCCT GCGrTCGGQGG AATCTACTGG GT0GT2SSAA A^TACTCAAGT 780 
TTCTCTQGCT TCCTCATCTT CCCTCTCTGA GGACCCAAGT YTITCAAGCA C-JiOAATCCA 840 
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C3CCX:CTGACA ACTTTCTTCT GOXTCTCTT GCCCCAGAAA CAGCAGAGGC AQGAGAGAGA 900 

CTCCCTCTGG YTCCTATCCC ACVTCTTTGC ATGGGAMCCT GTGCCAAACA CCCAAGTTTA 960 

AGARAARARY. ARARCTGWGG CAGC3TATACA GAGCTGGAAG TQGACCATGG AAAACATSGA 1020 

TAACCATGCA TCVTCTTGCT TGGCCACCTC CTGAAACTGT CCACCTTTGA AGTTTGAACT 1080 

TTAGTCCCTC CAMACTCTGA CTGCTGCCTC CTTCCTCCCA GCTCTCTCAC TGAGTTATVT - 1140 

TCACTGTACC TGTTCCAC5CA TATCCCCACT ATCTCTCTTT CTCCTGATCT GTGCTGTCTT 1200 

ATTCTCCTCC TTAGGCTTCC TATTACCTGG GATTCCATGA TTCATTCCTT CAGACCCTCT 1260 

15 CCTC3CCAGTA TGCTAAACCC TCCCTCTCTC TTTCTTATCC CGCTGTCCCA TTGGCCCAGC 1320 

CTGGATGAAT CTATCAATAA AACAACTAGA GAATGGTGGT CAAAAAAAAA AAAAAAAAAC 1380 

TCGA 1384 

20 



10 



25 



35 



55 



(2) INFORMATION FOR SEQ ID NO: 225: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

GGGTCGACCC ACGCGTCCGC TGACCAGTCC GTTATAGATA CTTCTTCCTA TACCAAAACT 60 
GTTTAAACAG GTGCCACCAC AAQGGATGTC GTCCTTACTC TCTGCGQGTC TTCAAGCATC , 120 

CCTTTGTGGG AAARGTCTCT GGGCAAGCAC GTGGTATTTG GTCTGCTGCT TGCTTCCCTT 180 

40 TTTCCACCAG GGATGTTGTG ATCATAAGTC AAAACAACAG TATATTCCAA ATCTCAAAAG 240 

CTATTGTGGC CTGAC3CACAA TTGAAATCTA GCAGAGTTTT TCCTATGTAG CTTTAGAGTA 300 

ACTCTTCTGC TTCTCTGTCA CTTACAATTC AGGTTCTGCC TTTGCCTAAG AGCATGAGCA 360 

45 

GAAGftGTCCT CATGTGACGC TTAGTTCTAT TGCAGTCCTG GGTGAAACTA TTTAAGCWAT 420 

GGGGCTGCTK CTCCCCANWT CCTCCCTAAC AATTCGTTGT GTGGACTTCT CATCTAAAAG 480 

50 GTTAGTGGCT TTTGCTTQGG ATCAGTGCTC TCTATTGATG TTCTTGCTGG TCTCCAGACA 540 

CATTCCTGTT GCATTAAGAC TTGAAAGACT TGTAGATGTG TGATGTTCAG GCACAGGATG 600 

CTGAAAGCTA TGTTACTATT CTTAGTTTGT AAATTGTCCT TTTGATACCA TCATCTTGTT 660 

TTCTTTTTGT AGGTATAAAT AAAAACACTG TTGACAATAA AAAAAAAAAA AAAAAAAAAA 720 

AAAAAAAAAA AAAAAAAAAA NAAAAAAAAA AAAAAAAAAA 760 
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(2) INFORMATION FOR SEQ ID NO: 226: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2057 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 226: 
CCGAGCCGGC TGCGCCGGGG GAATCCGTGC GGGCGCCTTC CGTCCCRGTC CCATCCTCGC 60 
15 CGCGCTCCAG CACCTCTGAA GTTTTGCAGC GCCCAGAAAG GAGGCGAGGA AGGAGGGAGT 120 



GTGTGAGAGG AGGGAGCAAA AAGCTCACCC TAAAACATTT ATTTCAAGGA GAAAAGAAAA 180 



AGGGGGGGCG CAAAAATGGC TGGGGCAATT ATAGAAAACA TGAGCACCAA GAAGCTGTGC 240 

20 

ATTGTTGGTG GGATTCTGCT CGTGTTCCAA ATCATCGCCT TTCTGGTGGG AGGCTTGATT 300 

GCTCCAGGGC CCACAACGGC AGTGTCCTAC ATGTCGGTGA AATGTGTGGA TGCCCGTAAG 360 

25 AACCATCACA AGACAAAATG GTTCGTGCCT TGGGGACCCA ATCATTGTGA CAAGATCCGA 420 



GACATTGAAG AQGCAATTCC AAGGGAAATT GAAGCCAATG ACATCGTGTT TTCTGTTCAC 480 
ATTCCOCTCC CCCACATGGA GATGAGTCCT TGGTTCCAAT TCATGMTGTT TATCCTGCAG 540 



CTGGACATTG CCTTCAAGCT AAACAACCAA ATCAORGAAA ATGCAGAAGT CTCCATGGAC 600 



GTTTCCCTGG CTTACCGTGA TGACGCGTTT GCTGAGTGGA CTGAAATGGC CCATGAAAGA 660 
35 GTACCACGGA AACTCAAATG CACCTTCACA TCTCCCAAGA CTCCAGAGCA TGGAGGGCCG 720 



GTTACTATGA ATGTGATGTC CTTOqTTTCA TGGAAATTGG GTCTGTGGCC CATGAAQTTT 780 



TACCTTTTAA ACATCCGGCT GCCTGTGAAT GAGAAGAAGA AAATCAATGT QGGAATTQGG 840 

40 

GAGATAAAGG ATATCCGGTT GGTQGQ6ATC CACCAAAATG GAGGCTTCAC CAAGGTGTGG 900 



TTTGCCATGA AGAOCTTCCT TACQCCCAGC ATCTTCATCA TTATGGTGTG GTATTGGAGG 960 
45 AGGATCACCA TGATGTCCCG AOCCCCAGTG CTTCTGGAAA AAGTCATCTT TGCCCTTGGG 1020 



ATTTCCATGA CCTTTATCAA TATCCCAGTG GAATGGTTTT CCATCGGGTT TGACTGGACC 1080 



TGGATGCTGC TGTTTGGTGA CATCCGACAG GCATCTTCTA TGCRATGCTT CTKTCCTTCT 1140 

50 

GGATCATCTT CTGTGGCGAG CACATGATGG ATCflGCACGA GCGGAAOCAC ATCGCAGGGT 1200 



ATTGGAAGCA AGTCGGACCC ATTGCCGTTG GTCCTTCTGC CTCTTCATAT TTGACATGTG 1260 

55 TGAGAGAGGG GTACAACTCA CGAATCC3CTT CTACAGTATC TGGACTACAG ACATTGQGAA 1320 

CAGAGCTGGC CATQGCTTTC ATCATCGTGG CtGGAATCTG CCTCTGOCTC TAACTTCCTG 1380 

TTTCTATGCT TCATGGTATT TCAGGTGTTT CGGAACATCA GTGGGAAGCA GTCCAGCCTG 1440 

60 
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CCA3CrArS.2L SCAJA3TCC3 ^CjGCTACAC 7ATG«GGGGC TAATTTTTAG GTTCAAGTTC 1500 

CTCrt'TSCTrA TCACCTTGGC rrC-CSCTCCC ATGACTGTCA TCTTCTTCAT CGTTAGTCAG 1560 
5 GT.=ACGSAA3 GCCArTGGG?-. AATGGGGCSG CGTCACASTC CCAAGTGAAC AGTGCCTTTT " 1620 

TC-.CAGGCir CTATC-C-GArO rC-GAATCTGT ATGrCTTTGC TCTGATGTTC TTGTATGCAC 1680 

a^TCCwvTA.-. AAACTATGGA a--;y3ACC--S? CC-«-:TGXVLT GCAACTCCCA TGTAAATCGA 1740 

10 

QGGAAGATTG TGCnTGTTT GTTTCGGAAC TTrATCA.-JGA ATTGTTCAGC GCTTCGAAAT 1800 

ATTCCTTCAr CAATGACAAC C-CAG-CTTCTG GTATTTGAG? CAACAAGGCA ACACATGTTT 1860 

15 ATG^r-GTITG CATTTGCAGr rTTCACAGTC ACArTGAZTG TACTTGTATA CGCACACAAA 1920 

TACACTCAT? TA3CC??rA.r CTCAAAATGT TAAATATAA3 GAAAAAAGCG TCAACAATAA 1980 

ATATTCTTTG ASTATrGTCT TACTTCTCTT AAAAAAAAAA AAAAAAACTC GTGCCGAATT 2040 

20 

CGGCACGAGC GSZACGA 2057 



25 

(2) i:3t)?:iA7r3N for S2Q ID >J0: 227: 

(A) LENGTH: 2 03-; base pairs 
30 (B) Ti?E: nucleic acid 

(C) STPAICSEJJSSS: doiibie 

(D) lir.ear 

(xi) SSQLilSCZ -rSSGPZ??::::: SH; TZ 227: 

35 

GGCAGA3GGC CATTTCCTGC A.2.AGAGCCAA ACCCCCATTC CTCTGTGCCC CTCCTCTCCC 60 

ACCAA^TGCr rrATAAAAAT AGcfCTTGTr ACCGGAAATA ACTGTTCATT TTTCACTCCT 120 

40 CCCTCCTAGG TCACACTTTT C-iGAAAAAGA ATCTGCAICC TGGAAACCAG AAGAAAAATA 180 

TGAGAGGGGG AATCAtCCrrG TGATGTGTGT SCTGCCTTTG GCTGAGTGTG TGGAGTCCTG 240 
CTCAGGTCTT ASSTACAGTG -TOTTGATCG TGGTGGCTTG AGGGGAACCX5 CTTGTTCAGA . 300 

45 

GCTGTGACTG C3GCTGCACT GGAGAGAAGC TGCCCTTGGC TGCTCCTAGC GCCGGGCXnT 360 

CTCTCCTCG? C=SCArcCAG AGCAGCCAGT GTCCGGGAGG CAGAAGGTAC CGGGGCAGCT 420 

50 ACTGGAGGAC rSTGCGGGCC T3CCTGGGCT GCCCCCTCCG CXX3TGGGGCC CTGTTGCTGC 480 

TGTCCATCTA nTCTACrA.C tCCCTCCCAA ATGCXSGTCGG CCCGCCCTTC ACTTGGATGC 540 

TTGCCCTCC7 GGGCCTTCTC GCAGGCACTG AACATCCTCC TGGGCCtCAA GGGCCTGGCC 600 

55 

CpyGCrGAGA rCTCTSGyGT GTOTGAAAAA GGGAATTTCA ACGTGGCCCA TGGGCTGGCA 660 

TGGTCAXATT ACATCGGATA TCTGCGGCrG ATCCTGCCAG AGCTCCAGGC CCGGATTCGA 720 

60 ACTTACAATC ASCAT7ACAA CAACCTGCTA CGGGGTGCAG TGAGCXAGCG GTGTNATATT 780 
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CTCCTCCCAT TQGACTGTGG GGTGCCTGAT AACCTGAGTA TGGCTGACCC CAACATTCGC 840 

TTCCTQGATA AACTGCCCCA GCAGACCQGT GACaTTGCTG GCATCAAGGA TCGC3GTTTAC 900 

5 

AGCAACAGCA TCTATGAGCT TCTGGAGAAC GGGCAGCGGG CGGGCACCTG TGTCCTGGAG 960 

TACGCXIACCC CCTTGCAGAC TTTGTTTGCC ATGTCACAAT ACAGTCAAGC TGGCTTTAGC 1020 

10 GGGGAGGATA GGCTTGAGCA GGCCAAACTC TTCTGCCGGA CACTTGAGGA CATCCTGGCA 1080 

GATGCCCCTG AGTCTCAC3AA CAACTGCCGC CTCATTGCCT ACCAGGAACC TGCAGATGAC 1140 

AGCAGCTTCT CGCTGTCCCA GGAQC3TTCTC CX3GCACCTGC GGCAGGAGGA AAAGGAAGAG 1200 

15 

GTTACTGTGG GCAGCTTGAA GACCTCAGCG GTGCCCAGTA CCTCCACGAT GTCCCAAGAG 1260 

CCTGAGCTCC TCATCAGTGG AATGGAAAAG CXTCCTCCCTC TCCGCACGGA TTTCTCTTGA 1320 

20 GACCCAGGGT CACCAGGCCA GAGCCTCCAG TGGTCTCCAA GCCTCTGGAC TGGGGGCTCT 1380 

CTTCAGTGGC TGAATGTCCA GCAGAGCTAT TTCCTTCCAC AGGGGGCCTT GCAGGGAAGG 1440 

GTCCAGGACT TGACATCTTA AGATGCX5TCT TGTCCCCTTG GGCCAGTCAT TTCCXXTCTC 1500 

25 

TGAGCCTCGG TGTCTTCAAC CTGTGAAATG GGATCATAAT CACTGCCTTA CCTCCCTCAC 1560 

QGTTGTTGTG AGGACTGAGT GTCTGGAAGT TTTTCATAAA CTTTGGATOC TAGTGTACTT 1620 

30 AGGGGGTGTG CCAQGTGTCT TTCATGGGGC CTKTAGACC CACTCCCCAC CCTTCTCCCC 1680 

TTCCTTTGCC CGGGGACXXTC GAACTCTCTC AATGGTATCA ACAGGCTCCT TCGCCCTCTG 1740 

GCTCCTGGTC ATGTTCCATT ATTGGGGAGC CCCAGCAGAA GAATGGAGAG GAGGAGGAGG 1800 

35 

CTGAGTTTGG GGTATTGAAT CCCCCGGCTC CCACCCTGCA GCATCAAGGT TGCTATGGAC 1860 

TCTCCTGCCG GGCAACTCTT GCGTAAOXrAT GACTATCTCT AGGATTCTGG CACCACTTCC 1920 

40 TTCCCTQGCC CCTTAAGCCT AGCTGTGTAT OGGCACCCCC ACCCXIACTAG AGTACTCCCT 1980 

CTCACTTGCG GTTTCCTTAT ACTCCACCCC TTTCTCAACG GTCCTTTTTT AAAGCACATC 2040 

TCAGATTAAA AAAAAAAAAA AAAAAAAAAA AGGGGGGGCN GCNT 2084 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 228: 



(i) SEQUENCE CHARACTERISTICS: 

(A) I.lE^3GTH: 2143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
55 (D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 228: 
TOGACCCACG CGTCCGGTTG AATTCCTTGA CCTGCAAACA CATATTTATT AGCCTGACTC 60 
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481 



AAACAATGAA GCTATTAAAA CTTCGGAGGA ACATTGTAAA ACTCTCTTTG TATCGGCATT 120 

TCACCAACAC GCTTATTTTG GCAGOXSGCAG CATCCATTGT GTTTATCATC TGGACAACCA 180 

TGAAGTTCAG AATAGTGACA TCTCAGTCGG ACTQGCQGGA GCTGTGGGTA GACGATGCCA 240 

TCTGGCGCTT GCTGTTCTCC ATGATCCTCT TTGTCATCAT GGTTCTCTGG CGACCATCTG 300 

CAAACAACCA GAGGTTTGCC TTTTCACCAT TGTCTGAGGA AGAGGAGGAG GATGAACAAA 360 

AGGAGCCTAT GCTGAAAGAA AGCTTTGAAG QAATGAAAAT GAGAAGTACC AAACAAGAAC 420 

CCAATGGAAA TAC3TAAAGTT AACAAAGCAC AGGAAGATGA TTTGAAGTGG GTAGAAGAGA 480 

15 ATGTTCCTTC TTCTGTGACA GATGTAGCAC TTCCAGCCCT TCTGGATTCA GATGAGGAAC 540 

GAATCATCAC ACACTTTGAA AGGTCCAAAA TGGAGTAAGG AATGGGAAGA TTTGCAGTTA 600 

AAGATQGCTA CCATCAGGGA AGAGATCAGC ATCTGTGTCA GTCTTCTGTA CGGCTCCATG 660 

20 

GGATTAAAG6 AAGCAATGAC ATCCTGATCT GTTCCTTGAT CTTTGGGCAT TGGAGTTGGC 720 

GAGAGGTGTC AGAACAAAGA GAACATCTTA CTGAAAACAA GTTCATAAGA TGAGAAAAAT 780 

25 CTACGAGCTT CTTATTTACA ACACTGCTGC CCCCTTTCCT CCCAGACTCT GACATGGATG 840 

TTCATGCAAC TTAAGTGTGT TGTTCCTGAA CTTTCTGTAA TGTTTCATTT TTTAAATCTG 900 

ACAAACTAAA AAGTTTAACG TCTTCTAAAA GATTGTCATC AACACCATAA TATGTAATCT 960 

30 

CCAGGAGCAA CTGCCTGTAA TTTTTATTTA TTTAGGGAGT TACATAGGTG ATGGGGGAAA 1020 

TTGTTAACTA CCTTTCATTT TCCTGGGAAG TCAAGGTTAC ATCTTGCAGA GGTTGTTTT6 1080 

35 AGAAAAAAGG GCXrCTTCTGA GTTAAGGAGC CATAGTTCTA TCAATGATCA AAAGAAAAAA 1140 

AAAAAAAAGA GAAACTGTTA CAGTATGATT CAGATCATTT AAAAAAGCAA AATCAAGTGC 1200 

AATTTTGTTT ACAAATGGTG TATATTAAAG ATTTTTCTAT TTCAGATGTA CTTTAAAGAG 1260 

40 

AAATATTAGC TTAACTCTTT TGACATCTGC TATTGTGACA CATCCCATTG CTGGCAATGT 1320 

QGTGCACACT COGAAACTTT TAACTACTGT TTTGTAAGCX: TCCAAGQGTG QCATTGCAGG 1380 

45 GTCCTTAGGC AATGTTTTGT TTGCCTTTAT GCAGAGAGGT GCTCCAAGTG CIGTGATTGA 1440 

GCACCX3TGCT AGAGGAACTG TAATGCTTCA GAAGTTGTAG CTTTATACAAA GGAAACAGGT 1500 

CCTGCTQGCT TAATTTAAAC AGTTATTGCA TGAAGTAGCG TGGAGGCCCT GGACTGCTGC 1560 

50 

TCGTTCTTTA GGATGGACTG TTCTQGTATC TCGTATTGGT TTAGAGACTG TTAATAAGGG 1620 

ACATCACAAG GTGATGGGAT TCATTTGAAG CACTCTATTT CTGTTTTAAT GGTTTTATCC 1680 

55 AATTTTGCCT TCCCAAGATT TTTGTTCTAC ATAAAAAGTT CATGCCACTT TTTAATATAA 1740 

AAAAATTTAA CAAAATTAAT GTATTrTTCT CATTTTTTTC AAACTTTTTC TAAAGACTCT 1800 

TTCTGTCAAA CTCATGAAAA ATTTCTTTCT ATGQCTTTTA TTCTAGATTC TCTTATTTTC 1860 
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TGTTAAAACC AATGACCACA TGACCACAAT CTTCACTAAC TCATACTGCA GTGAAAGTGT 
TAACCCTTAG GTAGTTTCTC TACAACTCTT TGCTATGGTG ATTTTTAAAA AAGTTTCCTA 
GGGAAGTATC TCTGAGGGAA CAGGCAATCT GAAGGAACTG ACTATATTCT CCATGGCTAA 
GTCCATTAGG CCAAAAGNCT GGGTQGGTAT TGGTTGTCAN GCTGTCTATT GGCATATTAA 
AAACGTAGGC CX3GANGGAAT AATTAGGTTG TNATGCCC5GC GC3G 

(2) INFORMATION FOR SEQ ID MO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1025 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 
CCTGGCCCAC ATTGCTTCAT TGGCCTGGCC ATGOGCCTGT ACTATGGCAG CCGCTAGTCC 
CTGACAACTT CCACCCTGAT TCCQGACCCT GTAGATTGGG CGCCACCACC AGATCCCCCT 
COCAGGCCTT CCTCCCTCTC CCATCAGCAG CCCTGTAACA AGTGCCTTGT GAGAAAAGCT 
GGAGAAGTGA QGGCAiSCCAG GTTATTCTCT GGAGGTTGGT GGATGAAGGG GTACCCTAGG 
AGATGTGAAG TGTGGGTTTG GTTAAGGAAA TGCTTACCAT CCCCCACCCC CAACCAAGTT 
CTTCCAGACT AAAGAATTAA GGTAACATCA ATACCTAGGC CTGAGAAATA ACCCCATCCT 
TGTTGGGCAG CTCCCTGCTT TGTCCTGCAT GAACAGAGTT GATGAAAGTG GGGTGTGGGC 
AACAAGTGGC TTTCCTTGCC TACTTTAGTC ACCCAGCAGA GCCACTGGAG CTGGCTAGTC 
CAGCCCAGCC ATGGTGCATG ACTCTTCCAT AAGGGATCCT CACCCTTCCA CTTTCATGCA 
AGAAGGCCCA GTTGCCACAG ATTATACAAC CATTACCCAA ACCACTCTGA CAGTCTCCTC 
CAGTTCCAGC AATGCCTAGA GACATGCTCC CTGCCCTCTC CACAGTGCTG CTCCCCACAC 
CTAGCCTTTG TTCTGGAAAC CCCAGAGAGG GCTGGGCTTG ACTCATCTCA GGGAATGTAG 
CCCCTQGGCC CTGGCTTAAG CCGACACTCC TGACCTCTCT GTTCACCCTG AGGGCTGTCT 
TGAAGCCCGC TACCCACTCT GAGGCTCCTA GGAGGTACCA TGCTTCCCAC TCTGQGGCCT 
GOCCCTGCCT AGCAGTCTCC CAGCTCCCAA CAGCCTGGGG AAGCTCTGCA CAGAGTGACC 
TGAGACCAGG TACAGGAAAC CTGTAGCTCA ATCAGTGTCT CTTTAACTGC ATAAGCAATA 
AGATCTTAAT AAAGTCTTCT AGGCTGTAGG GTGGTTCCTA CAACCACAGC CAAAAAAAAA 
AAAAA 
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(2) INFORMATION FOR SEQ ID NO: 230: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1250 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 
GCCCACGCGT CCGCCCACGC GTCCGGCGGT GCGGAGTATG GGGCGCTGAT GGCCATC3GAG 60 
15 GGCTACTGGC GCTTCCTGGC GCYGCTGGGG TCGGCACTGC TCGTCC5GCTT CCTGTCGGTG 120 
ATSTTCQCCC TCGTCTGGGT CCTCCACTAC CGAGAGGGGC TTGGCTGGGA TGGGAGCGCA 180 
CTAGAGTTTA ACTGGCACCC ACTGCTSATG GTCACCQGCT TCGTCTTCAT CCAGGGCATC 240 

20 

GCATCATCGT CTACAGACTG CCGTGGACCT GGAAATGCAG CAAGCTCCTG ATGAAATCCA 300 
TCCATGCAGG GTTAAATGCA GTTGCTGCCA TTCTTGCAAT TATCTCTGTG GTQGCOGTGT 360 
25 TTGAGAACCA CAATGTTAAC AATATAGCCA ATATGTACAG TCTGCACAGC TGGGTTGGAC 420 
TGATAGCTGT CATATGCTAT TTGTTACAGC TTCTTTCAGG TTTTTCAGTC TTTCTGCTTC 480 
CATGGGCTCC GCTTTCTCTC CGAGCATTTC TCATGCCCAT ACATGTTTAT TCTGGAATTG 540 

30 

TCATdTTGG AACAGTGATT GCAACAGCAC TTATGGGATT GACAGAGAAA CTGATTTTTT 600 

CCCTGAGAGA TCCTGCATAC AGTACATTCC CGCCAGAAGG TGTTTTCGTA AATACGCTTG 660 

35 GCCTTCTGAT CCUX3 G TGTIC GGGGCCCTCA. TTTTTTGGAT AGTCACCAGA CCGCAATGGA 720 

AACGTCCTAA GGAGCCAAAT TCTACCATTC TTCATCCAAA TGGAGGCACT GAACAGGGAG 780 

CAAGAGGTTC CATGCCAGCC TACTCTQGCA ACAACATGGA CAAATCAGAT TCAGAGTTAA 840 

40 

ACARTGAAGT AGCAGCAAGG AAAAGAAACT TAGCTCTGGA TGAGGCTGGG CAGAGATCTA 900 

CCATGTAAAA TGTTGTAGAG ATAGAGCCAT ATAACGTCAC GTTTCAAAAC TAGCTCTACA 960 

45 GTTTTGCTTC TCCTATTAGC CATATGATAA TTGGGCTATG TAGTATCAAT ATTTACTTTA 1020 

ATCACAAAQG ATGGTTTCTT GAAATAATTT GTATTGATT6 AQGCCTATGA ACTGACCTGA 1080 

ATTQGAAAGG ATGTGATTAA TATAAATAAT AGCAGATATA AATTGTGGTT ATGTTACCTT 1140 

50 

TATCTTGTTG AGGACCACAA CATTAGCACG GTGCCTTGTG CAKAATAGAT ACTCAATATG 1200 

TGAATATGTG TCTACTAGTA GTTAATTGGA TAAACTGGCA GCATCCCTGA 1250 

55 

(2) INFORMATION FOR SEQ ID NO: 231: 
60 (i) SEQUENCE CHARACTERISTICS: 
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<A) LENGTH: 1811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 
CNGNCAGTAC CGGTCNGATT CCCGGGTCGA CCCACGCGTC CGCTGCATTC CAGGGCCTTT 60 
10 CAGTGGCTTT .CATTCTGAAG TTCCTGGATA ACATGTTCCA TGTCTTGATG GCCCAQGTTA 120 



CCASTQTCAT TATCACAACA GTGTCTGTCC TGGTCTTTGA CTTCAGGCCC TCCCTGGAAT 180 

TTTTCTTQGA AGCCSCATCA GTCSTYCTCT CTATATTTAT TTATAATGCC AGCAAGCCTC 240 

15 

AAGTTCCGGA ATACGCACCT AGGCAAGAAA GGATCCGAGA TCTAAGTGGC AATCTTTGGG 300 

AGCGTTCCAG TGGGGATGGA GAAGAACTAG AAAGACTTAC CAAACCCAAG AGTGATGAGT 360 

20 CAGATCAAGA TACTTTCTAA CTGGTACCCA CATAGTTTGC AGCTCTCTTG AACCTTATTT 420 



TCACATTTTC AGTGTTTGTA ATATTTATCT TTTCACTTTG ATAAACCAGA AATGTTTCTA 480 



AATCCTAATA TTCTTTGCAT ATATCTAGCT ACTCCCTAAA TGGTTCCATC CAAQGCTTAG 540 

25 

AGTACCCAAA GGCTAAGAAA TTCTAAAGAA CTGATACAGG AGTAACAATA TGAAGAATTC 600 

ATTAATATCT CAGTACTTGA TAAATCAGAA AGTTATATGT GCAGATTATT TTCCTTGGCX: 660 

30 TTCAAGCTTC CAAAAAACTT GTAATAATCA TGTTAGCTAT AGCTTGTATA TACACATAGA 720 

GATCAATTTG CCAAATATTC ACAATCATGT AGTTCTAGTT TACATGCCAA AGTCTTCCCT 780 

TTTTAACATT ATAAAAGCTA GGTTGTCTCT TGAATTTTGA GGCCCTAGAG ATAGTCATTT 840 

35 

TGCAAGTAAA GAGCAACQGG ACCCTTTCTA AAAACGTTGG TTGAAGGACC TAAATACCTG 900 

GCCATACCAT AGATTTGGGA TGATGTAGTC TCTQCTAAAT ATTTTGCTGA AGAAGCAGTT 960 

40 TCTCAGACAC AACATCTCAG AATTTTAATT TTTAGAAATT CATGGGAAAT TGGATTTTTG 1020 



TAATAATCTT TTGATGTTTT AAACATTGGT TCCCTAGTCA CCATAGTTAC CACTTGTATT 1080 



TTAAGTCATT TAAACAAGCC ACGGTGGGGC TTTTTTCTCC TCAGTTTGAG GAGAAAAATC 1140 

45 

TTGATGTCAT TACTCCTGAA TTATTACATT TTGGAGAATA AGAGGGCATT TTATTTTATT 1200 

AGTTACTAAT TCAAGCTGTG ACTATTGTAT ATCTTTCCAA GAGTTGAAAT GCTGGCTTCA 1260 

50 GAATCATACC AGATTGTCAG TGAAGCTGAT GCCTAGGAAC TTTTAAAGGG ATCCTTTCAA 1320 



AAGGATCACT TAGCAAACAC ATGTTGACTT TTAACTGATG TATGAATATT AATACTCTAA 1380 



AAATAGAAAG ACCAGTAATA TATAAGTCAC TTTACAGTGC TACTICACAC TTAAAAGTGC 1440 

55 

ATGGTATTTT TCATGGTATT TTGCATGCAG CCAGTTAACT CTCGTAGATA GAGAAGTCAG 1500 

GTGATAGATG ATATTAAAAA TTAGCAAACA AAAGTGACTT GCTCAGGGTC ATGCAGClfeG 1560 

60 GTGATGATAG AAGAGTGGGC TTTAACTGGC AGGCCTGTAT GTTTACAGAC TACCATACTG 1620 
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TAAATATGAG CTTTATGGTG TCATTCTCAG AAACTTATAC ATTTCTGCTC TCCTTTCTCC 1680 

TAAGTTTCAT GCAGATGAAT ATAAGGTAAT ATACTATTAT ATAATTCATT TGTGATATCC 1740 

5 

ACAATAATAT GACTGGCAAG AATTGGTGGA AATTTGTAAT TAAAATAATT ATTAAACCTA 1800 

AAAAAAAAAN N * . l^ll 

10 



(2) INFORtlATION FOR SEQ ID NO: 232: 

15 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2271 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEISIESS : doioble 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 
dGACCTCAT GGCGTAGAGC CTAGCAACAG CGCAGGCTCC CAGCCGAGTC CGTTATGGCC 60 
25 GCTGCCGTCC CGAAGAGGAT GAGGGGGCCA GCACAAGCGA AACTGCTGCC CGGGTCGGCC 120 
ATCCAAGCCC TTCTGGGGTT GGCGCGGCCG CTGGTCTTGG CGCTCCTGCT TGTGTCCGCC 180 
GCTCTATCCA GTCTTOTATC ACGGACTGAT TCACCGAGCC CAACCGTACT CAACTCACAT 240 

30 

ATTTCTACCC CAAATGTGAA TGCTTTAACA CATGAAAACC AAACCAAACC TTCTATTTCC 300 
CAAATCAIGCA CCACCCTCCC TCCCACGACG AGTACCAAGA AAAGTGGAGG AGCATCTGTG 360 
35 GTCCCTCATC CCTCGCCTAC TCCTCTGTCT CAAGAGGAAG CTGATAACAA TGAAGATCCT 420 
AGTATAGAGG AGGAGGATCT TCTeATCSCTG AACAGTTCTC CATCCACAGC CAAAGACACT 480 
CTAGACAATG GCGATTATGG AGAACCAGAC TATGACTGGA CCACGGGCCC CAGQGACGAC 540 

40 

GACGAGTCTC ATNGACACCT TGGAAGAAAA CAGGGGTTAC ATGGAAATTG AACAGTCAGT 600 
GAAATCTTTT AAGATGCCAT CCTCAAATAT AGAAGAGGAA GACAQCCATT TCTTTTTTCA 660 
45 TCTTATTATT TTTGCTTTTT GCATTGCTGT TGTTTACATT ACATATCACA ACAAAAGGAA 720 
GATTTTTCTT CTGGTTCAAA GCAGGAAATG GCGTGATQGC CTTTGTTCCA AAACAGTGGA 780 
ATACCATCGC CTAGATCAGA ATGTTAATGA GGCAATGCCT TCTTTGAAGA TTACCAATGA 840 

50 

TTATATTTTT TAAAGCACTG TGATTTGAAT TTGCTTATGT AATTTTATTT GCTTGACTTT 900 
TTATATGATA TTOIGCAAAT GTTTGCCATA GGCAATTGGT ACTTAAATGA GAGGTGAGTC 960 
55 TCTCTTTTGC CTTGGTGCTT TGGAAATTAA ATGTCACAAA CGAGTATATA ATTTTTTATC 1020 
TGTACTTTTA GAGCTGAGTT TAATCAGGTG TCCAAAATGT GAGTTAAACA TTACCTTATA 1080 
TTTACACTCT TAGTTTTTAT TGTTTTAGAT TTATTATGCT TCTTCTQGAA GTATTAGTGA 1140 

60 
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TGCTACTTTT AAAAGATCCC AAACTTGTAA CTAAATTCTG ACATATCTGT TACTGCTGAC 1200 

TCACATTCAT TCTCCGCCAT TCAAATACTA TTTTTTATCC ACATTTTTTT TTGTTCCCAA 1260 

ACTCTAATGT ACAAGGATAT GTGTGATAAT GCTTTGGATT TGAGTAATAT TTTTTTTTCT 1320 

TCCAAGAAAA CTGCTTTQGA TATTTTTAGA TAATTTAAAC ATAATTTAGG ATAATGATAT 1380 

TGCTCAATCT GACCACAATT TTAGGTAAAA CATTAAATGT GTCAAGAAAT CTTGGCAACA 1440 

GAGACTCTCC AGCTTGCAGT GGACATAGAT AAAATGTTAC AGAGATACTA TTTTTTTGGT 1500 

TGGAATTACT ATATTAAATT TAGAAGCAGA AACTGGTAAA ATGTTAAATA CATGTACAAT 1560 

15 TGCTTTTAGT TAGCAATTGA TTGTAGCATG GGTTCCTCCA AGGTTTCAAG CAATGGGCAG 1620 

AGTTTAAAAT TATATCAGAT TCGTTTACTT CGTTTATTAT TTTACAGTAA ATTTGAATAA 1680 

ATCTTAGGGG TCATTATCAC TTAAATAATA CTGTACCTAG GOXrTTTCAAA TTAAAATTAT 1740 

20 

ACCTGAATGA AGrTGTTTGT ATACATAAAG GATATTTC?rG TACAATTACC TTTTTTCCCC 1800 

CACACTTCTT TTCTTTOTTT TTGTTTTTTA TGGCAACTGG AAAGTATTTA CTATGGGATT 1860 

25 CATTTATGTC TGTCTTTCTA TCATAAAGAA TTGATCAATA TGTAAATATG TGATTTGAAC 1920 

CATGGTTGAC TTACAAGTGT CACTACAGCT TTTTAGAAAA CATAGCCCTA ATATATGTTA 1980 

AGCAGGACCC GGGTGAGCCA GTGGGCTTGC GCTTTATGTA GAGCTGGAAG AAGGCCGTCC 2040 

ATCCTGTCTC TTCGGCGGAC AGTOTACTTT CCTAATAGQG AAGGGAAGCA CAATQGAAAT 2100 

ACCCCTGAAC CGTTTTATTG CAGTAATTTT TTTCATATCT GAAACTATTA TTTAATATTT 2160 

35 TGAATAAGAT TTTAAAAAAT AAATGGCAAA GATATAAATC TAAAAAAAAA AAAAAAAAAA 2220 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAANANA N 2271 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 233: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

CTTCCGGTTC TCCQGGCAGC TGCCACTGCT GTAGCTTCTG CCACCTGCCA CGACCGGGOC 60 

TCTCCCTGGC GTTTGGTCAC CTCTGCTTCA TTCTCCAOCG OGCCTATGGT CCCTCTTGGA 120 

55 

GCCAGCGTGG CCaJGCCTGGC GGCTCCCGGG TGGTGAGAGA GCGGTCOGGG AACGATGAAG 180 

GCCTCGCAGT GCTGCTGCTG TCTCAGCCAC CTCTTGGCTT CCGTCCTCCT CCTGCTGTTG 240 

60 CTCCCTGAAC TAAGCGGGYC CCTGC3MAGTC CTGCTGCAGG CAGCCGAGGC CGCGCCAGGT 300 
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YTTGQGCX:rrC CTGACCCTAG ACCAGGACAT TACOGCCGCT GCCACCGGGC CCTWACCCXT 
GCXrCAGCAGC CGGGCCGTGG TCTGGCTGAA GCTGCGGGGG CXXK33GGGCT CCGAGGGAGG 
CAATGGCAGC AACCCTGTGG CCGGGCTTGA GACGGACGAT CACX3GAGGGA AGGCXX3GGGA 
ARGCTCGGTG GGTGGCGGCC TTGCTGTGAG CCCCAACCCT GGCGACAAGC CCATGACCCA 
GCGGGCCCTG ACCGTGTTGA TGGTGGTGAG CGGCGCGGTG CTGGTGTACT TCGTGGTCAG 
GACGGTCAGG ATGAGAAGAA GAAflCCGAAA GACTAGGAGA TATGGAGTTT TQGACACTAA 
CATAGAAAAT ATGGAAITCA CACCTTTAGA ACAGGATGAT GAGGATGATG ACAACACGTT 
GTTTGATGCC AATCATCCTC GAAGATAAGA ATGTGCCTTT TGATGAAAGA ACTTTATCTT 
TCTACAATGA AGftGTGGAAT TTCTATGTTT AAGGAATAAG AAGCCACTAT ATCAATGTTG 
GGGGGGTATT TAAGTTACAT ATATTTNAAC AACCTTTAAT TTGCTGTTGC AATAAATACC 
GTATCCTTTT ATTATATCTT TATATGTATA GAAGTACTCT GTTAATGGGC TCAGAGATGT 
TQGGGATAAA GTATACTGTA ATAATTTATC TGTTTGAAAA TTACTATAAA ACGGTGTTTT 
CTGRTCGGTT TT TG TTTCCT GCTTACCATA TGATTGTAAA TTGTTTTATG TATTAATCAG 
TTAATGCTAA TTATTTTTGC TGATGTCATA TGTTAAAGAG CTATAAATTC CAACAACCAA 
CTGGTGTGTA AAAATAATTT AAAATYTCCT TTACTGAAAG GTATTTCCCA TTTTTGTGGG 
GAAAAGAAGC CAAATTTATT ACTTTGTGTT GGGGTTTTTA AAATATTAAG AAATGTCTAA 
GTTATTGTTT GCAAAACAAT AAATATGATT TTAAATTCTC TTAAAAAAAA AAAAAAAAAC 
CCCGGGGGGG GGCCTGGN 

(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCnCHi 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 

Met Leu Ser Thr Gly lie Glu Val Ala Arg Pro Pro Ala Thr Leu Leu 
1 5 10 15 

Gly Leu Met Phe Val Leu Thr Gly Met Pro Arg Gly Leu Arg Xaa 
20 25 30 

(2) INFORMATION FOR SEQ ID MO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 
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<B) TOPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

Met Asn Val Val He Val He He Leu Phe Ser Phe Asp Ser Val Gly 
1 5 10 15 

Thr Met Phe Ser Cys Asn Arg He Pro Lys He Thr Val Leu Asn Lys 
20 25 30 

Leu Lys Phe Xaa Cys Glu Val Leu Leu Arg He Gin Thr He Gin Gly 
35 40 45 

Phe Tyr Arg Cys Thr Arg He Ser Arg Tyr Lys Gly He Phe Pro Asp 

50 55 60 ' . 

Phe Cys Gin Ser Gin Cys Met Gly Cys Asn Pro Glu Ser Xaa Met Ala 
65 70 75 80 

Val Pro Ala Leu Val Thr Pro He Leu Ala His Arg Lys Lys Glu Lys 
85 90 95 

Gly Met Cys Leu Phe Thr Leu He He Ala Pro Thr Arg Cys Thr His 
100 105 110 

Tyr Phe Cys Xaa 
115 



(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCBIPTION: SEQ ID NO: 236: 

Met Ser Ser Ala Lys He Val Arg Gin Arg Gly Ala Val Pro Thr Tyr 
1 5 10 15 

Tyr Thr Thr Glu Ala Gly Glu He He Phe Leu Val Leu Asn Trp Ser 
20 25 30 

Leu Ser He Leu His He Val Asp Val Leu Cys Ser Lys Pro Glu Lys 
35 40 45 

Ser Val Thr Glu Asp Ala Ala Ser .Gly Leu Ser Gin Arg Mfet Thr Ala 
50 55 60 

Leu Val Trp Arg Lys Gly Pro Asp Gly Gly Ser Arg Lys Pro He Leu 
65 70 75 80 

Leu Leu Phe Phe Phe Leu Pro Leu He Leu Cys Phe His Ser Phe He 
85 90 95 . 

His Ser Ser Asn He Cys Xaa 
100 
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(2) INFORMA.TIC»(I FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEl^GTH: 42 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

Met lie Leu Phe Pro Gin Xaa Ala Leu Arg Leu Gly Xaa Trp Pro Arg 
1 5 10 15 

Thr Trp Ser lie Leu Xaa Lys Tyr Ser Val Asn Phe Phe Ser Ala Tyr 
20 25 30 

Ser Pro Met Gly Ala Val Gly Thr Glu Phe 
35 40 



(2) INFORMATKXI FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids . 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

Met lie lie Leu Leu Leu Phe Met Leu Leu Asn Asn Val Val Leu Val 
15 10 15 

Gin Glu Asp Asn Cys Gin Arg Lys Asn Thr Val Gin Glu Arg Arg Xaa 
20 25 30 

Trp Ser Gin Trp Xaa 
35 



(2) INFORMATION FOR SEQ ID NO: 239: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 

Met Ala Ala Xaa Pro Pro Gly Cys Thr Pro Pro Xaa Leu Leu Asp lie 
1 5 10 15 

Ser Trp Leu Thr Glu Ser Leu Gly Ala Gly Gin Pro Val Pro Val Glu 
20 25 30 

Cys Arg His Arg Leu Glu Val Ala Gly Pro Arg Lys Gly Pro Leu Ser 
35 40 45 

Pro Ala Trp Met Pro Ala Tyr Ala Cys Gin Arg Pro Thr Pro Leu Thr 
50 55 60 

His His Asn Thr Gly Leu Ser Glu Leu Leu Glu His Gly Val Cys Glu 
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65 70 75 80 

Glu Val Glu Arg Val Arg Arg Ser Glu Arg Tyr Gin Thr Met Lys Val 
85 90 95 

Arg Arg Ala Gly Leu Gly Pro Thr Pro Gly Met Ser Cys Pro Gly Asn 
100 105 110 

Asp Asn Thr Val His Thr Met His Gly Glu Ala Asn Arg Gly Ser Xaa 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 andno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

Met Ser He Leu Cys Cys Pro Xaa Leu Cys Leu Phe Phe Ser Phe Cys 
15 10 15 

He Ser Ser Gly Ser Cys Pro Phe Ser His Val Ser Gin Leu Ser Phe 
20 25 30 

He Ala Thr Phe Ser Gin Ser Ser Pro Val Leu Leu Val Pro Ala Tyr 
35 40 45 

Asn Thr Tyr Leu Ser Phe Leu Ala Phe Leu Asp Cys Ala Ser Leu Thr 
50 55 60 

Ser Thr Xaa 
65 



(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

Met Ser Thr Phe Gin Leu Leu Leu Leu He Leu Ala Gin Ser Thr Tyr 
1.5 10 15 

Lys He Lys Ser Lys Pro Leu His Met Thr Asn His Thr Leu Leu Asn 
20 25 30 

Ser Pro Gly Leu Asn Pro Ser Ser Pro Thr Leu Asn Phe Lys Thr Gin 
35 40 .45 



Gin His Glu Ser Val Ser Tyr Ala Cys Cys His Met Arg Ser Leu His 
50 55 60 
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His Ala Phe Ala Xaa 
65 



(2) INFORMATIC»I FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOIOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

Met Val Ser Val Val Leu lie Phe Ser Phe Leu Ser Leu Thr He Ser 
15 10 15 

Thr Thr Ala Ser Ala Tyr Asn Gly Asn Asp Thr Gin Gly Trp Asn Asp 
20 25 30 

Lys Phe His Xaa Xaa Ser Val Lys Thr Gin Thr Xaa 
35 40 



(2) INFORMATION FOR SEQ ID 1«): 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

Met He Ser Asp Ala Gly Ala Gly Phe Gly Val Phe Leu Leu Val Pro 
15 10 15 

Arg Ala Gly His Cys Trp Gly Ala Gly Lys Pro Leu Pro Ser Cys Pro 
20 25 30 

Ser Val Ala Ser He Pro Ser Trp Val Leu Pro Ser Phe Leu Glu Arg 
35 40 45 

Gly Arg Xaa 
50 



(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 244: 

Met Val Gin Thr He Gin Asp Phe Leu Ser Leu Phe Ser Thr Pro He 
15 10 15 



Phe Leu Leu Leu Leu Met Phe Glu Thr Leu Ser Leu Ala Pro Ala Trp 
20 25 30 
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Leu Lys Pro Leu Arg Vsu rhr £er His Ser Xaa 
35 4C 



(2) INFOFMATICM rCR SE^ ZD IC: 145: 

(A) L£:37n-£: 51 a=ir-C acids 
(3) Ty?£: 3mir.3 acii 

(xi) secih:x3 r£5:n?:i?ric?:: szq id :vC: 245: 

Met He Leu Me^ Pro Gly l^u Sly riir Ser Arg Gin Arg Ser Val Pro 
15 i: .15 

Phe Val Pro Thr Leu Asr. Ala £er "Hir Pro Gly Ala Met Thr Gly Pro 
20 25 30 

Thr Ala Thr Leu Thr Ser Cys 31r. Trp Thr Thr Ala Cys Arg Val Ser 
35 45 

Trp Ala Asn Gly Trp Thr Ser leu Arg Thr Phe Arg Zaa 
50 55 50 



(2) INFORMATION FCK SZQ ZD l*Z'. 245: 

(i) SECCZ^X3 CHA?ACTH?-mC3: 

(A) LH:J3TH: 35 szzzlc acitSs 

(3) Ti?Z: aiair.z acid 

(D) TCPCIOGY: lir^ar 
(xi) SEQUSrCE IESCRI?rrC«: SHQ ID 24S: 

' Met Ser His His Ala Gin ?so Arg Phe Leu Leu lie Thr Yj&z Leu Leu 
1 5 10 15 

Gin Glu Ala Lys Pro Val Ser Asr. lis Pro His Leu Leu Glu Ser Trp 
20 25 30 

Tyr Phe Gly Xaa 
35 



■ (2) INFORMATICS FCR SZQ ID IC: 247: 

(i) SEQUENCE CHAPACTSCSTICS : 

(A) LSXHH: 3= ardno acids 

(B) Tr=Z: amino acid 
(D) TCPIIOGY: liiiear 

(xi) SEQUENCS ZZSCBl?71C3^i SZQ ID 2K5: 247: 

Met Asn Ser Leu Phe Trp Met He Leu Leu Pro Val Ser Gin Asp Gin 
1 5 * 10 15 

Val Val Glu Gly Leu Gin Gly Gly Phe Ser Gin He His Met Arg He 
20 25 30 
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l>eu Arg Lys His Leu Xaa 
35 



(2) INFORMATION FOR SBQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) ZjENGTH: 211 amino acids y 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

Met Ser Arg Ser Xaa Asp Val Thr Asn Thr Thr Phe Leu Leu Met Ala 
1 5 10 15 

Ala Ser lie Tyr Leu His Asp Gin Asn Pro Asp Ala Ala Leu Arg Ala 
20 25 30 

Leu His Gin Gly Asp Ser Leu Glu Cys Thr Ala Met Thr Val Gin He 
35 40 45 

Leu Leu Lys Leu Asp Arg Leu Asp Leu Ala Arg Lys Glu Leu Lys Arg 
50 55 60 

Met Gin Asp Leu Asp Glu Asp Ala Thr Leu Thr Gin Leu Ala Thr Ala 
65 70 75 80 

Trp Val Ser Leu Ala Thr Gly Gly Glu Lys Leu Gin Asp Ala Tyr Tyr 
85 90 95 

He Phe Gin Glu Met Ala Asp Lys Cys Ser Pro Thr Leu Leu Leu Leu 
100 105 110 

Asn Gly Gin Ala Ala Cys His Met Ala Gin Gly Arg Trp Glu Ala Ala 
115 • 120 125 

Glu Gly Leu Leu Gin Glu Ala Leu Asp Lys Asp Ser Gly Tyr Pro Glu 
130 135 140 

Thr Leu Val Asn Leu He Val Leu Ser Gin His Leu Gly Lys Pro Pro 
145 150 155 160 

Glu Val Thr Asn Arg Tyr Leu Ser Gin Leu Lys Asp Ala His Arg Ser 
165 170 175 

His Pro Phe He Lys Glu Tyr Gin Ala Lys Glu Asn Asp Phe Asp Arg 
180 185 190 

Leu Val Leu Gin Tyr Ala Pro Ser Ala Glu Ala Gly Pro Glu Leu Ser 
195 - 200 205 

Gly Pro Xaa 
210 



(2) INFORMATION FOR S?Q ID NO: 249: 
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{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 548 aitdno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

Met Glu Asp Ser Glu Ala Leu Gly Phe Glu His Met Gly Leu Asp Pro 
1 5 10 15 

10 Arg Leu Leu Gin Ala Val Thr Asp Leu Gly Trp Ser Arg Pro Thr Leu 
20 25 30 

He Gin Glu Lys Ala He Pro Leu Ala Leu Glu Gly Lys Asp Leu Leu 
35 40 . 45 

15 

Ala Arg Ala Arg Thr Gly Ser Gly Lys Thr Ala Ala Tyr Ala He Pro 
50 55 60 

Met Leu Gin Leu Leu Leu His Arg Lys Ala Thr Gly Pro Val Val Glu 
20 65 70 75 80 

Gin Ala Val Arg Gly Leu Val Leu Val Pro Thr Lys Glu Leu Ala Arg 
85 90 95 

25 Gin Ala Gin Ser Mfet He Gin Gin Leu Ala Thr Tyr Cys Ala Arg Asp 
100 105 110 

Val Arg Val Ala Asn Val Ser Ala Ala Glu Asp Ser Val Ser Gin Arg 
115 120 125 

30 

Ala Val Leu Met Glu Lys Pro Asp Val Val Val Gly Thr Pro Ser Arg 
130 135 140 

He Leu Ser His Leu Gin Gin Asp Ser Leu Lys Leu Arg Asp Ser Leu 
35 145 150 155 160 

Glu Leu Leu Val Val Asp Glu Ala Asp Leu Leu Phe Ser Phe Gly Phe 
165 * 170 175 

40 Glu Glu Glu Leu Lys Ser Leu Leu Cys His Leu Pro Arg He Tyr Gin 
180 185 190 

Ala Phe Leu Met Ser Ala Thr Phe Asn Glu Asp Val Gin Ala Leu Lys 
195 200 205 

45 

Glu Leu He Leu His Asn Pro Val Thr Leu Lys Leu Gin Glu Ser Gin 
. 210 215 220 

Leu Pro Gly Pro Asp Gin Leu Gin Gin Phe Gin Val Val Cys Glu Thr 
50 225 230 235 240 

Glu Glu Asp Lys Phe Leu Leu Leu Tyr Ala Leu Leu Lys Leu Ser Leu 
245 250 255 

55 He Arg Gly Lys Ser Leu Leu Phe Val Asn Thr Leu Glu Arg Ser Tyr 
260 • 265 270 

Arg Leu Arg Leu Phe Leu Glu Gin Phe Ser He Pro Thr Cys Val Leu. 
275 280 285 

60 
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Asn Gly Glu Leu Pro Leu Arg Ser Arg Cys His lie lie Ser Gin Phe 
290 295 300 

Asn Gin Gly Phe Tyr Asp Cys Val lie Ala Thr Asp Ala Glu Val Leu 
5 305 310 315 320 

Gly Ala Pro Val Lys Gly Lys Arg Arg Gly Arg Gly Pro Lys Gly Asp 
325 330 335 

10 Lys Ala Ser Asp Pro Glu Ala Gly Val Ala Arg Gly He Asp Phe His 
340 345 350 

His Val Ser Ala Val Leu Asn Phe Asp Leu Pro Pro "Thr Pro Glu Ala 
355 360 365 

Tyr He His Arg Ala Gly Arg Thr Ala Arg Ala Asn Asn Pro Gly He 
.370 375 380 

Val Leu Thr Phe Val Leu Pro Thr Glu Gin Phe His Leu Gly Lys He 
20 385 390 395 400 

Glu Glu Leu Leu Ser Gly Glu Asn Arg Gly Pro He Leu Leu Prp Tyr 
405 410 415 

25 Gin Phe Arg Met Glu Glu He Glu Gly Phe Arg Tyr Arg Cys Arg Asp 
420 425 430 

Ala Met Arg Ser Val Thr Lys Gin Ala He Arg Glu Ala Arg Leu Lys 
435 440 445 

30 

Glu He Lys Glu Glu Leu Leu His Ser Glu Lys Leu Lys Thr Tyr Phe 
450 455 460 

Glu Asp Asn Pro Arg Asp Leu Gin Leu Leu Arg His Asp Leu Pro Leu 
35 465 470 475 480 

His Pro Ala Val Val Lys Pno His Leu Gly His Val Pro Asp Tyr Leu 
485 490 495 

40 Val Pro Pro Ala Leu Arg Gly Leu Val Arg Pro His Lys Lys Arg Lys 
500 505 510 

Lys Leu Ser Ser Ser Cys Arg Lys Ala Lys Arg Ala Lys Ser Gin Asn 
515 520 525 

45 

Pro Leu Arg Ser Phe Lys His Lys Gly Lys. Lys Phe Arg Pro Thr Ala 
530 535 540 

Lys Pro Ser Xaa 
50 545 



(2) INFORMATICS FOR SEQ ID NO: 250: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 299 amino acids 

(B) TYPE: azcdno acid 
(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 
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Met Thr Thr Val Pro Pro Ser Pro Arg Pro Met Ser Arg Pro. Ser Glu 
1 5 10 15 

Arg Asn Met Arg Arg Pro Arg Gly Pro Ser Pro Leu Pro Ala Ser Pro 
20 25 30 

Arg Asn Ser Thr Pro Asp Glu Pro Asp Val His Phe Ser Lys Lys Phe 
35 i 40 45 

Leu Asn VaX Phe Met Ser Gly Arg Ser Arg Ser Ser Ser Ala Glu Ser 
50 55 60 



Phe Gly Leu Phe Ser Cys lie lie Asn Gly Glu Glu Gin Glu Gin Thr 
15 65 70 75 80 

His Arg Ala lie Phe Arg Phe Val Pro Arg His Glu Asp Glu Leu Glu 
85 90 95 

20 Leu Glu Val Asp Asp Pro Leu Leu Val Glu Leu Gin Ala Glu Asp Tyr 
100 105 110 



25 



Trp Tyr Glu Ala Tyr Asn Met Arg Thr Gly Ala Arg Gly Val Phe Pro 
115 120 125 

Ala Tyr Tyr Ala lie Glu Val Thr Lys Glu Pro Glu His Met Ala Ala 
130 135 140 



Leu Ala Lys Asn Ser Asp Trp Val Asp Gin Phe Arg Val Lys Phe Leu 
30 145 150 155 160 

Gly Ser Val Gin Val Pro Tyr His Lys Gly Asn Asp Val Leu Cys Ala 
165 170 175 

35 Ala Met Gin Lys He Ala Thr Thr Arg Arg Leu Thr Val His Phe Asn 
180 185 190 



40 



Pro Pro Ser Ser Cys Val Leu Glu He Ser Val Arg Gly Val Lys lie 
195 200 205 

Gly Val Lys Ala Asp Asp Ser Gin Glu Ala Lys Gly Asn Lys Cys Ser 
210 215 220 



His Phe Phe Gin Leu Lys Asn He Ser Phe Cys Gly Tyr His Pro Lys 
45 225 230 . 235 240 

Asn Asn Lys Tyr Phe Gly Phe He Thr Lys His Pro Ala Asp His Arg 
245 250 255 

50 Phe Ala Cys His Val Phe Val Ser Glu Asp Ser Thr Lys Ala Leu Ala 
260 265 270 



55 



Glu Ser Val Gly Arg Ala Phe Gin Gin Phe Tyr Lys Gin Phe Val Glu 
275 280 285 

Tyr Thr Cys Pro Thr Glu Asp He Tyr Leu Glu 
290 295 



60 
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(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 andno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIGN: SEQ ID NO: 251: 

Leu Leu Tyr Leu Leu Lys Val Xaa Val lie Phe Val Phe Ser Ser Ser 
1.5 10 15 

Lys Gly Val Thr Leu Val Ser Met Asn Leu Thr Ser Phe Phe Val Ser 
20 25 30 

Ser Val Leu Ala Cys Phe Ser Xaa 
35 40 



(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 594 amino acids 
^ (B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ. ID NO: 252: 

Met Pro Ala Ser Ser Leu Glu Ser Arg Ser Phe Leu Leu Ala Lys Lys 
15 10 15 

Ser Gly Glu Asn Val Ala Lys Phe lie lie Asn Ser Tyr Pro Lys Tyr 
20 25 30 

Phe Gin Lys Asp lie Ala Glu Pro His lie Pro Cys Leu Met Pro Glu 
35 40 45 

Tyr Phe Glu Pro Gin lie Lys Asp lie Ser Glu Ala Ala Leu Lys Glu 
50 55 60 

Arg He Glu Leu Arg Lys Val Lys Ala Ser Val Asp Met Phe Asp Gin 
65 70 75 80 

Leu Leu Gin Ala Gly Thr Thr Val Ser Leu Glu Thr Thr Asn Ser Leu 
85 90 95 

Leu Asp Xaa Leu Cys Tyr Tyr Gly Asp Gin Glu Pro Ser Thr Asp Tyr 
100 105 110 

His Phe Gin Gin Thr Gly Gin Ser Glu Ala Leu Glu Glu Glu Asn Asp 
115 120 125 

Glu Thr Ser Arg Arg Lys Ala Gly His Gin Phe Gly Val Thr Trp Arg 
130 135 140 

Ala Lys Asn Asn Ala Glu Arg He Phe Ser Leu Met Pro Glu Lys Asn 
145 150 155 160 



Glu His Ser Tyr Cys Thr Met He Arg Gly Met Val Lys His Arg Ala 
165 170 175 
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Tyr Glu Gin Ala Leu Asn Leu Tyr Thr Glu Leu Leu Asn Asn Arg Leu 
180 185 190 

His Ala Asp Val Tyr Thr Phe Asn Ala Leu He Glu Ala Thr Val Cys 
195 200 205 

Ala He Asn Glu Lys Phe Glu Glu Lys Trp Ser Lys He Leu Glu Leu 
210 215 220 

Leu Arg His Met Val Ala Gin Lys Val Lys Pro Asn Leu Gin Thr Phe 
225 230 235 240 

Asn Thr He Leu Lys Cys Leu Arg Arg Phe His Val Phe Ala Arg Ser 
245 250 255 

Pro Ala Leu Gin Val Leu Arg Glu Met Lys Ala He Gly He Glu Pro 
260 265 270 

Ser Leu Ala Thr Tyr His His He He Arg Leu Phe Asp Gin Pro Gly 
275 280 285 

Asp Pro Leu Lys Arg Ser Ser Phe He He Tyr Asp He Met Asn Glu 
290 295 300 

Leu Met Gly Lys Arg Phe Ser Pro Lys Asp Pro Asp Asp Asp Lys Phe 
305 310 315 320 

Phe Gin Ser Ala Met Ser He Cys Ser Ser Leu Arg Asp Leu Glu Leu 

325 330 335 

Ala Tyr Gin Val His Gly Leu Leu Lys Thr Gly Asp Asn Trp Lys Phe 
340 345 350 

He Gly Pro Asp Gin His Arg Asn Phe Tyr Tyr Ser Lys Phe Phe Asp 
355 360 365 

Leu He Cys Leu Met Glu Gin. He Asp Val Thr Leu Lys Trp Tyr Glu 
370 375 380 

Asp Leu He Pro Ser Ala Tyr Phe Pro His Ser Gin Thr Met He His 
385 390 395 400 

Leu Leu Gin Ala Leu Asp Val Ala Asn Arg Leu Glu Val He Pro Lys 
405 410 415 

He Trp Lys Asp Ser Lys Glu Tyr Gly His Thr Phe Arg Ser Asp Leu 
420 425 430 

Arg Glu Glu He Leu Met Leu Met Ala Arg Asp Lys His Pro Pro Glu 
435 . 440 445 

Leu Gin Val Ala Phe Ala Asp Cys Ala Ala Asp He Lys Ser Ala Tyr 
450 455 460 



Glu Ser Gin Pro He Arg Gin Thr Ala Gin Asp Trp Pro Ala Thr Ser 
465 470 475 480 



Leu Asn Cys He Ala He Leu Phe Leu Arg Ala Gly Arg Thr Gin Glu. 

485 490 495 
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Ala Trp Lys Met Leu Gly Leu Phe Arg Lys His Asn Lys lie Pro Arg 
500 505 510 

Ser Glu Leu Leu Asn Glu Leu Met Asp Ser Ala Lys Val Ser Asn Ser 
515 520 525 

Pro Ser Gin Ala lie Glu Val Val Glu Leu Ala Ser Ala Phe Ser Leu 
530 535 540 

Pro lie Cys Glu Gly Leu Thr Gin Arg Val Met Ser Asp Phe Ala lie 
545 550 555 560 

Asn Gin Glu Gin Lys Glu Ala Leu Ser Asn Leu Thr Ala Leu Thr Ser 
565 570 575 

Asp Ser Asp Thr Asp Ser Ser Ser Asp Ser Asp Ser Asp Thr Ser Glu 
580 585 590 

Gly Lys 



(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 253: 

Met Lys Leu Asn Leu Cys lie Pro Asn Trp Ala Arg Cys Pro Leu Leu 
15 10 15 

Leu Leu Phe Pro Gin Leu Leu Pro Phe Gin Gly Glu Asp Asp Asp Pro 
20 25 30 

Leu Lys Ala Lys Ala Ala Asn Leu Val Glu Ala Val Pro Trp Gly He 
35 40 45 

Lys Ala Pro Ser Phe Gin Val Thr Cys Leu Val Arg Val Gin Leu Gin 
50 55 60 

Ser Cys Thr Pro Ser Arg Pro Ser Thr Leu Leu Ala Thr Ser Gin Ser 
65 70 75 80 

Pro Gly Arg He Ser Cys Tyr Ser Pro Leu Ser His Leu Pro Pro Val 
85 90 95 

Thr Thr Ser He Gin Pro Ser Pro Val Met Val Pro Phe Gin Tyr Gin 
100 105 110 

Ala Phe Leu Leu Gin Val Lys Glu Pro Ala Ala Gin Thr Leu Leu Gly 
115 120 125 



Gin Gin Xaa 
130 
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(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYWEi amino acid 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

Met Arg Tyr His Ala Gin Leu lie Phe Cys He Phe Cys Xaa Phe Val 
15 10 15 

Phe Val Xaa Lys Xaa 
20 



(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

Met Asn Asp Asn Ser Pro Asn His Ser Ser Ser Tyr Leu Pro Leu Pro 
15 10 15 

Leu Thr He Val He Leu Gin Thr Gly His Lys Gly Thr Leu Xaa 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

Met His Phe Leu Phe Arg Phe He Val Phe Phe Tyr Leu Trp Gly Leu 
1 5 10 15 

Phe Thr Ala Gin Arg Gin Lys Lys Glu Glu Ser Thr Glu Glu Val Lys 
20 25 30 

He Glu Val Leu His Arg Pro Glu Asn Cys Ser Lys Thr Ser Lys Lys 
35 40 45 

Gly Asp Leu Leu Asn Ala His Tyr Asp Gly Tyr Leu Ala Lys Asp Gly 
50 55 60 

Ser Lys Phe Tyr Cys Ser Arg Thr Gin Asn Glu Gly His Pro Lys Trp 
65 70 75 80 

Phe Val Leu Gly Val Gly Gin Val He Lys Gly Leu Asp He Ala Met 
85 90 95 



Thr Asp Met Cys Pro Gly Glu Lys Arg Lys Val Val He Pro Pro Ser 
100 105 110 
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Phe Ala Tyr Gly Lys Glu Gly Tyr Ala Glu Gly Lys He Pro Pro Asp 
115 120 125 

Ala Thr Leu He Phe Glu He Glu Leu Tyr Ala Val Thr Lys Gly Pro 
130 135 140 

Arg Ser He Glu Thr . Phe Lys Gin He Asp Met Asp Asn Asp Arg Gin 
145 150 155 160 

Leu Ser Lys Ala Glu He Asn Leu Tyr Leu Gin Arg Glu Phe Glu Lys 
165 170 175 

Asp Glu Lys Pro Arg Asp Lys Ser Tyr Gin Asp Ala Val Leu Glu Asp 
180 185 190 

He Phe Lys Lys Asn Asp His Asp Gly Asp Gly Phe He Ser Pro Lys 
195 200 205 

Glu Tyr Asn Val Tyr Gin His Asp Glu Leu Xaa 
210 215 



(2) INFORMATIC»I FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERIffTICS: 

(A) XjENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

Met Trp Val He Arg Val Phe Gin Lys Thr Phe Leu Phe Phe Val Leu 
1 5 10 15 

Phe Trp Ser Val His Cys He Ser Asp Lys Phe Gly Cys Leu Trp His 
20 25 30 

Val Cys Met Lys Arg Glu Gly Asp Xaa Asn Cys Leu Ser Phe Ser Xaa 
35 40 45 

Leu Xaa 
50 



(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

Met Pro Ser Gin Thr Glu Xaa Phe Ala Ala Cys Gly Gly His Ser Leu 
1 5 10 15 



Leu Leu Val Xaa Leu Pro Leu Gly Leu Fro Phe Cys Pro Arg Ala Aleu 
20 25 30 
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Leu Cys Asp Leu Pro Phe Ser Leu Pro Ser Phe Pro Gly Gin Ala Arg 
35 40 45 

Arg Gly Gly Ala Glu Lys Gin Gly Ala Glu Gly Arg Gly Leu Gin Val 
50 55 60 

Lys Pro Arg Gly Gin Arg Thr Phe Gin Val Ser Arg Thr Ala Pro Ala 
65 70 75 80 

Ala Pro Arg Ser Arg Gin Pro Arg Pro Pro Ala Ala Leu Pro Ala Leu 
85 90 95 

Gly Phe Gly Gly Arg Gly Val Ala Lys Gly Arg Phe Leu Cys Phe Trp 
100 105 110 

Cys Leu Tyr Met Leu Arg lie Asp Gin Xaa 
115 120 



(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

Met Thr Ala Phe Cys Ser Leu Leu Leu Gin Ala Gin Ser Leu Leu Pro 
1 5 10 15 

Arg Thr Met Ala Ala Pro Gin Asp Ser Leu Arg Pro"- Gly Glu Glu Asp 
20 . 25 30 

Glu Gly Met Gin Leu Leu Gin Thr Lys Asp Ser Met Ala Lys Gly Ala 
35 40 45 

Arg Pro Gly Ala Xaa Arg Gly Arg Ala Arg Trp Gly Leu Ala Tyr Thr 
50 55 60 

Leu Leu His Asn Pro Thr Leu Gin Val Phe Arg Lys Thr Ala Leu Leu 
65 70 75 80 

Gly Ala Asn Gly Ala Gin Pro Xaa 
85 



(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

Met lie Gin Val Ser Val Pro Leu Leu Thr He Met He Phe Leu Leu 
15 10 15 



Tyr Leu Gin He Gly Pro Gly Lys Leu Xaa 
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20 25 



(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEECTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

Met Leu Leu Asp Pro Phe lie Leu Leu Phe Cys Leu Phe Ser Thr Ala 
15 10 .15 

Ala Gin Ser Cys Leu Glu Phe He Tyr He Gin Phe Xaa 
20 25 



(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

Met Lys Phe Leu Ser He Leu Leu Asp Asp Asn Asn Phe Xaa Leu Met 
1 5 10 15 . 

Leu Met Leu Ala Pro Phe Gly Cys Leu Ala Phe Glu Arg Ser Met Lys 
20 25 30 

Met Arg Asn Gly Ala Leu Gly Leu Glu Glu Val Xaa 
35 40 



(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 363 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

Met Arg Thr Leu Phe Asn Leu Leu Tzp Leu Ala Leu Ala Cys Ser Pro 
1 5 10 15 

Val His Thr Thr Leu Ser Lys Ser Asp Ala Lys Lys Ala Ala Ser Lys 
20 25 30 

Thr Leu Leu Glu Lys Ser Gin Phe Ser Asp Lys Pro Val Gin Asp Arg 
35 40 45 

Gly Leu Val Val Thr Asp Leu Lys Ala Glu Ser Val Val Leu Glu His 
50 55 60 



Arg Ser 



Tyr Cys Ser Ala Lys Ala Arg Asp Arg His 



Phe Ala Gly Asp 
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65 



70 



75 



80 



Val Leu Gly Tyr Val Thr Pro Trp Asn Ser His Gly Tyr Asp Val Thr 
85 90 95 

Lys Val Phe Gly Ser Lys Phe Thr Gin He Ser Pro Val Trp Leu Gin 
100 105 110 

Leu Lys Arg Arg Gly Arg Glu Met Phe Glu Val Thr Gly Leu His Asp 
115 120 _ 125 

Val Asp Gin Gly Trp Met Arg Ala Val Arg Lys His Ala Lys Gly Leu 
130 135 , 140 

His He Val Pro Arg Leu Leu Phe Glu Asp Trp Thr Tyr Asp Asp Phe 
145 150 155 160 

Arg Asn Val Leu Asp Ser Glu Asp Glu He Glu Glu Leu Ser Lys Thr 
165 170 175 

Val Val Gin Val Ala Lys Asn Gin His Phe Asp Gly Phe Val Val Glu 
180 185 190 

Val Trp Asn Gin Leu Leu Ser Gin Lys Arg Val. Thr Asp Gin Leu Gly 
195 200 205 

Met Phe Thr His Lys Glu Phe Glu Gin Leu Ala Pro Val Leu Asp Gly 
210 215 220 

Phe Ser Leu Met Thr Tyr Asp Tyr Ser Thr Ala His Gin Pro Gly Pro 
225 230 235 240 

Asn Ala Pro' Leu Ser Trp Val Arg Ala Cys Val Gin Val Leu Asp Pro 
245 250 255 

Lys Ser Lys Trp Arg Ser Lys He Leu Leu Gly Leu Asn Phe Tyr Gly 
260 • 265 270 

Mfet Asp Tyr Ala Thr Ser Lys Asp Ala Arg Glu Pro Val Val Gly Ala 
275 280 285 

Arg Tyr He Gin Thr Leu Lys Asp His Arg Pro Arg Met Val Trp Asp 
290 295 300 

Ser Gin Xaa Ser Glu His Phe Phe Glu Tyr Lys Lys Ser Arg Ser Gly 
305 310 315 320 

Arg His Val Val Phe Tyr Pro Thr Leu Lys Ser Leu Gin Val Arg Leu 
325 330 335 

Glu Leu Ala Arg Glu Leu Gly Val Gly Val Ser He Trp Glu Leu Gly 
340 345 350 



Gin Gly Leu Asp Tyr Phe Tyr Asp Leu Leu Xaa 
355 360 



(2) INFORMATION FOR SEQ ID NO: 264: 
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(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) T0P0U>3Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264.: 

Leu Pro Thr Lys He Leu Val Lys Pro Asp Arg Thr Phe Glu He Lys 
15 10 15 

He Gly Gin Pro Thr Val Ser Tyr Phe Leu Lys Ala Ala Ala Gly He 
20 25 30 

Glu Lys Gly Ala Arg Gin Thr Gly Lys Glu Val Ala Gly Leu Val Thr 
35 40 45 

Leu Lys His Val Tyr Glu He Ala Arg He Lys Ala Gin Asp Glu Ala 
50 55 60 

Phe Ala Leu Gin Asp Val Pro Leu Ser Ser Val Val Arg Ser He He 
65 70 75 80 

Gly Ser Ala Arg Ser Leu Gly He Arg Val Val Lys Asp Leu Ser Ser 
85 90 95 

Glu Glu Leu Ala Ala Phe Gin Lys Glu Arg Ala He Phe Leu Ala Ala 
100 105 110 

Gin Lys Glu Ala Asp Leu Ala Ala Gin Glu Glu Ala Ala Lys Lys Xaa 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino £Lcid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 

Met Leu Leu Gin He His Pro Leu Leu Pro Ser Pro Thr He Pro His 
1 5 10 15 

He Leu Leu Leu Phe Leu Tyr Pro Thr Phe Ser He Leu Glu His Ser 
20 25 30 

Cys Ser Tyr Cys He Glu Tyr Leu Trp Val Cys Leu Leu Phe Cys Leu 
35 40 45 

Ser Leu Trp Phe Leu Xaa 
50 



(2) INFORMATICS FOR SEQ ID NO: 266: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 

Met Cys Leu Trp Cys Cys Gly Asp Val Cys Ser Gly Leu Ser Ser Leu 
15 10 15 

Leu Ser Leu Cys Val Cys Cys Val Val Leu Ala Val Cys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

Glu Gly Leu Arg Leu Leu Leu Ser Leu Pro Ala Ala Leu Pro Arg Ser 
15 10 15 

Cys Cys His Pro Arg Trp Leu Pro Val Xaa 
20 25 



(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 221 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

Met Phe His Gly He Pro Afa Thr Pro Gly He Gly Ala Pro Gly Asn 
15 10 15 

Lys Pro Glu Leu Tyr Glu Glu Val Lys Leu Tyr Lys Asn Ala Arg Glu 
20 25 30 

Arg Glu Lys Tyr Asp Asn Met Ala Glu Leu Phe Ala Val Val Lys Thr 
35 40 45 

Met Gin Ala Leu Glu Lys Ala Tyr He Lys Asp Cys Val Ser Pro Ser 
50 55 60 

Glu Tyr Thr Ala Ala Cys Ser Arg Leu Leu Val Gin Tyr Lys Ala Ala 
65 70 75 80 

Phe Arg Gin Val Gin Gly Ser Glu He Ser Ser He Asp Glu Phe Cys 
85 90 95 

Arg Lys Phe Arg Leu Asp Cys Pro Leu Ala Met Glu Arg He Lys Glu 
100 105 110 

Asp Arg Pro He Thr He Lys Asp Asp Lys Gly Asn Leu Asn Arg Cys 
115 120 125 
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lie Ala Asp Val Val Ser Leu Phe lie Thr Val Met Asp Lys Leu Arg 
130 135 140 

Leu Glu He Arg Ala Met Asp Glu lie Gin Pro Asp Leu Arg Glu Leu 
145 150 155 160 

Met Glu Thr Met His Arg Met Ser His Leu Pro Pro Asp Phe Glu Gly 
165 170 175 

Arg Gin Thr Val Ser Gin Trp Leu Gin Thr Leu Ser Gly Met Ser Ala 
180 . 185 190 

Ser Asp Glu Leu Asp Asp Ser Gin Val Arg Gin Met Leu Phe Asp Leu 
195 200 205 

Glu Ser Ala Tyr Asn Ala Phe Asn Arg Phe Leu His Ala 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 amino acids 

(B) TYPE: axrdno acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

M6t Lys Xaa 
1 



(2) INFORMATION FOR SEQ ID NO: 270.: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

Met Gin Ala Pro Phe Xaa His Phe Ser Phe Arg Met Phe Ser Asn Leu 
1 5 10 .15 

Tyr Cys Phe Ser Asp Phe Gin Pro Asn lie Ser Pro Cys Pro Leu Cys 
20 25 30 

His C^s lie Leu Pro Xaa His His His Val Phe Leu Leu Leu Ala Val 
35 40 45 

Xaa 



(2) INFORMATION FOR SEQ ID NO: 271: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIOfl : SEQ ID NO: 271: ■ 

Met Lys Leu Val Thr Met Phe Asp Lys Leu Ser Arg Asn Arg Val He 
1 5 10 15 

Gin Pro Met Gly Met Ser Pro Arg Gly His Leu Thr Ser Leu Gin Asp 
20 25 30 

Ala Met Cys Glu Thr Met Glu Gin Gin Leu Ser Ser Asp Pro Asp Ser 
35 40 45 

Asp Pro Asp Xaa 
50 



(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

Met Ala Val Gly Glu Ala Val Phe Val Pro Leu Gin His Pro Pro Leu 
1.5.. 10 15 

Leu His Gly Ser Pro- He Pro Lys Leu Leu Pro Gly Pro Leu Leu Xaa 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUEDXZE DESCRIPTIC»T: SEQ ID NO: 273: 

Met Asn Gly Cys His Arg Arg Lys Arg Leu His Leu Cys Lys Thr He 
1 5 10 .15 

Tyr Leu Leu Trp Phe Val Phe Ser Phe Leu Leu Ser Asn Glu Val Val 
20 25 30 

Ser Ser His Trp His He Leu Arg Ala Val Gin He He Cys Thr Leu 
35 40" 45 

Phe His Arg Xaa He Ser Ala Phe Xaa 
50 55 



(2) INFORMATION FOR SEQ ID NO: 274: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
• (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

Met Gly Trp Val Ser Ser Pro His Val Lys Arg Arg Glu Cys Val Leu 
15 10 15 

Lys Lys Pro Phe Phe Xaa 
20 



(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

Met Phe Asn Phe Phe Lys Asn Pro Leu Leu Thr Cys Leu Phe lie Ser 
1.5 10 15 

Cys Tyr Leu Tyr Leu Ser Leu Leu Val Asn Lys Val Leu Phe Ala Glu 
20 25 30 

Glu Gly Leu Cys Cys Thr Tyr Cys Thr Thr Ser Asn Thr Gly Glu Gly 
35 40 45 

Gly Val Xaa 
50 



(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUE^K:E CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

Met Xaa 
1 



(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTEEIISTICS : 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 



Met Leu Cys Thr He Leu Thr Val Val He He He Ala Ala Gin Thr 
1 5 10 15 
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Thr Arg Thr Thr Gly lie Pro Lys Asn Ala Pro Gly Pro Ala Pro Leu 
20 25 30 

5 Cys Ala Pro Arg Ser Pro Arg Leu Phe Leu Gin Xaa Tyr Arg Gly Pro 
35 40 45 

Asn Gly Arg Pro Ala His Pro Phe Leu Gly Pro Ser Asp Leu Asp Thr 
50 55 60 



10 



15 



Ser Xaa 
65 



(2) INPORMATIC»I FOR SEQ ID NO: 278: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 aitdno acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

Met Leu Gly Ala Lys Pro His Trp Leu Pro Gly Pro Leu His Ser Pro 
25 1 5 10 15 

Gly Leu Pro Leu Val Leu Val Leu Leu Ala Leu Gly Ala Gly Trp Ala 
20 25 ,30 

30 Gin Glu Gly Ser Glu Pro Val Leu Leu Glu Gly Glu Cys Leu Val Val 
35 40 45 

Cys Glu Pro Gly Arg Ala Ala Ala Gly Gly Pro Gly Gly Ala Ala Leu 
50 55 60 

35 

Gly Glu Ala Pro Fro Gly Arg Val Ala Phe Xaa Ala Val Arg Ser His 
65 70 • ' 75 80 

His His Glu Pro Ala Gly Glu Thr Gly Asn Gly Thr Ser Gly Ala lie 
40 85 90 95 

Tyr Phe Asp Gin Val Leu Val Asn Glu Gly Gly Gly Phe Asp Arg Ala 
' 100 105 110 

45 Ser Gly Ser Phe Val Ala Pro Val Arg Gly Val Tyr Ser Phe Arg Phe 
115 120 125 

His Val Val Lys Val Tyr Asn Arg Gin Thr Val Gin Val Ser Leu Met 
130 . 135 140 

50 

Leu Asn Thr Trp Pro Val lie Ser Ala Phe Ala Asn Asp Pro Asp Val 
•145 150 155 160 

Thr Arg Glu Ala Ala Thr Ser Ser Val Leu Leu Pro Leu Asp Pro Gly 
55 165 170 175 

Asp Arg Val Ser Leu Arg Leu Arg Arg Gly Xaa Ser Thr Gly Trp Leu 

180 185 190 

60 Glu lie Leu Lys Phe Leu Trp Leu Pro His Leu Pro Ser Leu Lys Asp 
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195 2:0 205 

Pro Ser Leu Ser Ser Thr Arg lie Gin Pro Leu Thr Thr Phe Phe Cys 
210 215 220 

Pro --eu leu Pro Zaa Lys Gin ICsa Lys Gin Xaa Xaa Xaa Ser Leu Trp 
225 25: 235 240 

Leu I#eu Ser His Leu Phe Ala Trp Glu Pro Val Pro Asn Thr Gin Val 
245 " 250 255 

Xaa 



(2) =NFOSM?CIC»I FCP. 5BZ ID NG: 279: 

(i) SSQfHNCE C-:?i?ACTH?J3TICS: 

(A) I^3GtH: 102 axaino acids 

(3) 1Y?H: amine acid 

(3) TOPCLCG"/: linear 
(xi) SECUESCE riSCHIrCIGN: SEQ ID NO: 279: 

Mew Ala Pro Arg Ala Leu Pro Gly Ser Ala Val Leu Ala Ala Ala Val 
i . ' 5 10 15 

Phe Val Gly Gly Ala Val Ser Ssr Pro Leu Val Ala Pro Asp Asn Gly 
20 25 30 

Ser Ser Arg Thr Leu His Ser Arg Thr Glu Thr Thr Pro Ser Pro Ser 
35 40 . 45 . 

Asn Asp Thr Gly Asn Gly His Pro Glu Tyr lie Ala Tyr Ala Leu Val 
50 55 60 

Pro Val Phe Phe lis Mez Giy leu Phe Gly Val Leu lie Xaa Pro Xaa 
55 7j 75 80 

Xaa 2aa Lys Lys Lys Gly Tyr Arg Cys Thr Thr Glu Ala Glu Gin Asp 
85 90 95 

lie Glu Glu Glu Lys Gly 2Caa 
100 



(2) 2NFCH^iXnCH FOR SEQ ID NO: 280: 

(i) SEQL-ENC2 CiARACISISTICS : 

^A) .LEI3GTH: 33 amino acids 

(3) TV?£: amino acid 

(D) TOPCLOGY: linear 
(xi) SEQUENCE ESSC31IPTIGN: SEQ ID NO: 280: 

Met Pro Val Thr Leu Ser Ser leu Gly Phe Trp Val Leu Leu Ser Leu 
1 5 * 10 15 



Leu Phe Pro Trp Arg Thr Asp Gin Gly Cys Gly Pro Ala Thr Cys Tyr 
20 25 30 
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(2) INFOPMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I : SEQ ID NO: 281: 

Met Val Leu Gly Leu Leu Leu Leu Leu Xaa Phe Phe Ser Phe Ser Ser 
15 10 15 

Ser Pro Ser Pro Ser Ser Ser Leu Leu Leu Leu Ser Ser Phe Phe Phe 
20 25 30 

Gin Ser Leu Ala Leu Ser Pro Arg Leu Glu Xaa 
35 40 



(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOJ: SEQ ID NO: 282: 

Glu Trp Leu Val Phe Thr Phe Leu Leu Val Phe Gly Ser Pro Leu Gly 
15 10 15 

Lys Gly Pro Leu Xaa 
20 



(2) INFORMATim FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 283: 

Met He Arg Ala Leu Ser Leu Phe Leu Leu He Phe Asp Ala Ala Leu 
15 10 15 

Phe Ser Leu Ser Val Phe Val Phe He Gly His Leu Leu Pro Met Pro 
20 25 30 

Lys . Gly Thr Gly Leu His Ser Cys Ala Lys His Leu He Lys Ser Leu 
35 40 45 

Lys Glu Asn Val Leu Pro Leu Met Asn Tyr Pro Asp Cys Lys Leu Lys 
50 55 60 
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He Asn He Ser Pro Xaa 
65 70 



(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

Met Gly Lys Leu He Arg Leu Ser Val Met Val Met Ser Val Arg Arg 
1 5 10 15 

Leu Phe Ser He Tyr Trp Val Leu Ser Thr Val Pro Asp Ala Val Gly 
20 25 30 

Ser Arg Gly Gly Met Glu Glu Glu Cys Ser Arg Gly Leu Cys Cys Val 
35 40 45 

Ala Gly Gin His Lys Gin Ala Lys Gly Lys Arg Gin Ala Trp Asn Lys 
50 55 60 

Gly Gly Glu Tyr Gin Cys Val Thr Tyr Cys Xaa 
65 70 75 



(2) INFOPMATIO? FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

Met Pro Ala Leu Val Thr Leu Leu Leu Leu Phe Pro Leu Leu Pro Leu 
15 10 15 

Met Glu Ala Ser Cys His Val Met Arg Cys Pro Met Glu Arg Pro Thr 
20 25 30 

Xaa 



(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS': 

(A) l£NGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

Glu Ala Pro Trp Gly Leu Leu Lys Leu Leu Leu Leu Leu Ala Val Phe 
15 10 15 
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Xaa 



(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

Met Gin Gin Lys Gin Lys Lys Ala Asn Glu Lys Lys Glu Glu Pro Lys 
15 10 15 

Xaa 



(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQfJmCB DESCRIPTION: SEQ ID NO: 288: 

Met Gin Arg Lys Val Ser Asp Phe lie lie His Gin Arg Leu Thr Val 
1 5 10 15 

Asn Leu Cys Val lie Ser Phe Phe Phe Phe Leu Pro lie Cys lie Phe 
20 25 30 

Ser Leu Ala Lys Lys Xaa, 
35 



(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

Met Ala Leu Leu lie Ser Ser Leu lie Trp Ser Xaa 
1 5 10 . 



(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

Met Gin Met Phe Thr Val Ser Leu Leu Leu Ser Leu Leu Leu Arg Ser 
1 5 10 .15 

Thr Asp Gin Asn His Leu Gin Leu Leu Val Gly Arg Glu Asp His Tyr 
20 25 30 

Gly Gly Xa& 
35 



(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

Met Ser Glu Ser Ala Cys lie Leu Asn Asn Gin Lys Glu Leu Xaa 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

Met Asp Leu Asp Arg Val Lys Ala Glu Ala Thr Glu Asp lie Thr Ser 
1 5 10 15 

Gly Val Leu Cys Leu Leu Phe Leu Arg Leu Pro Pro Asn Ser Cys lie 
20 25 30 

Phe Pro Ser Ala Val Leu Gly Ser Thr Arg Thr Xaa 
35 40 



(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

Val Val Gly Thr Gly Thr Ser Leu Ala Leu Ser Ser Leu Leu Ser Leu 
15 10 15 

Leu Leu Phe Ala Gly Met Gin Met Tyr Ser Arg Gin Leu Ala Ser Thr 
20 25 30 

Glu Trp Leu Thr He Gin Gly Gly Leu Leu Gly Ser Gly Leu Phe Val 
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35 ^40 45 

Phe Ser Leu Thr Ala Phe Asn Asn Leu Glu Asn Leu VaX Phe Gly Lys 
50 55 60 

Gly Phe Gin Ala Lys He Phe Pro Glu He Leu Leii Cys Leu Leu Leu 
65 70 75 80 

Ala Leu Phe Ala Ser Gly Leu He His Arg Val Cys Val Thr Thr Cys 
85 90 95 

Phe He Phe Ser Met Val Gly Leu Tyr Tyr He Asn Lys He Ser Ser 
100 105 iio 

Thr Leu Tyr Gin Ala Ala Ala Pro Val Leu Thr Pro Ala Lys Val Thr 
lis 120 125 

Gly Lys Ser Lys Lys Arg Asn Xaa 
130 135 



(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LOIGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

Met Phe He Phe Leu Phe Leu Cys Val Leu Ser Arg Lys He Gin Glu 
1 5 10 15 

Glu Tyr Tyr Arg Leu Phe Lys Asn Val Pro Cys Cys Phe Gly Cys Leu 
20 25 30 

Arg Xaa 



(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEE^GTH: 137 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

Met Arg Thr Pro Gly Pro Leu Pro Val Leu Leu Leu Leu Leu Ala' Gly 
15 10 15 

Ala Pro Ala Ala Arg Pro Thr Pro Pro Thr Cys Tyr Ser Arg Met Arg 
20 25 30 

Ala Leu Ser Gin Glu He Thr Arg Asp Phe Asn Leu Leu Gin Val Ser 
35 40 45 

Glu Pro Ser Glu Pro Cys Val Arg Tyr Leu Pro Arg Leu Tyr Leu Asp 
50 55 60 
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lie His Asn Tyr Cys Val Leu Asp Lys Leu Arg Asp. Phe Val Ala Ser 
65 70 75 80 

Pro Pro Cys Trp Lys Val Ala Gin Val Asp Ser Leu Lys Asp Lys Ala 
85 90 95 

Arg Lys Leu Tyr Thr lie Met Asn Ser Phe Cys Arg Arg Asp Leu Val 
100 105 110 

Phe Leu Leu Asp Asp Cys Asn Ala Leii Glu Tyr Pro He Pro Val Thr 
115 120 125 

Thr Val Leu Pro Asp Arg Gin Arg Xaa 
130 135 



(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

Met Trp Leu Leu Lys Pro Ser Ala His Ser Pro Val His Xaa Leu Val 
15 10 15 

h&x Leu Phe Pro Arg Gly Trp Ser Gin Pro Gly Thr His Lys Arg Gin 
20 25 30 

He Leu Val Asn Xaa Ala Ser Leu Pro Gly Gly Cys Leu Leu Pro Trp 
35 40 45 

He Trp Ser Gly Ala Ala Leu Arg Phe Xaa 

50 55 . . 



(2) INFORMATION FOR SEQ ID NO: 297: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY:, linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

Met Ser Arg Arg Ala Glu Ala Ser He Phe Val Leu Pro Lys Thr Leu 
15 10 15 

Leu Phe Val Leu Phe Pro Ala Phe Pro Ser Pro Ala Val Gly Cys Pro 
20 25 30 

Val Pro Xaa 
35 



(2) INFORMATION FOR SEQ ID NO: 298: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 

Ser Cys Tyr lie Thr Pro Trp Ser Lys lie Gin Ser Phe Ser Leu Ser 
1 5 10 15 

Leu Phe Gin Phe He Leu Gin Glu Val Asn He Thr Leu Pro Glu Asn 
20 25 30 

Ser Val Trp Tyr Glu Arg Tyr Lys Phe Asp He Pro Val Phe His Leu 
35 40 45 

Asn Gly Gin Phe Leu Met Met His Arg Val Asn Thr Ser Lys Leu Glu 
50 55 60 

Lys Gin Leu Leu Lys Leu Glu Gin Gin Ser Thr Gly Xaa Xaa 
65 70 .75 



(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 299: 

Met Phe Val Leu Phe Ser Leu Pro Lys Tyr Ala Gly Leu Arg Leu Pro 
15 10 15 

He Pro Gly Leu Ser Ala Leu Leu Val Phe Leu Leu Ser Leu Phe Ser 
20 25 30 

Arg Arg Ala Gin Val Glu Leu Thr Thr Gly Arg Glu Thr Leu Pro Lys 
35 . 40 45 

Asn Leu Gin Gly Tyr Phe Pro Glu Phe Gly Phe Gin Val Gin Asn Phe 
50 55 60 

Leu Ser Cys Lys He Tyr Ala Ala Ser Gin Lys Gin Pro Leu Pro Pro 
65 70 75 80 

Leu Tyr Gin Leu Arg Phe Tyr Leu Lys His Met Gly Leu Pro Xaa 
85 90 " 95 



(2) INFORMATICS) FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 
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Met Ser Ser His Trp Thr Leu Lys lie Leu Leu Val Pro Leu Phe Tyr 
1 5 10 15 

Leu Ser Leu Glu Phe Pro Ser Gly Phe Val Leu Cys Leu Ala Asn Asp 
20 25 30 

Leu Gly Tyr His Phe Ser Ser Arg Val Arg Ser Xaa 
35 40 



10 



(2) INFORMATION FOR SEQ ID NO: 301: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

20 Mfet Leu Val Val Asn He Asn Leu Val Phe Leu Leu Phe Phe He Phe 
1 5 10 15 



25 



Leu Cys Tyr Leu Asp Ala Cys He Asn Val Phe Cys Phe Tyr Xaa 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 302: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 



35 



Met Pro Val Leu Pro Gly Arg Thr Thr Ala Leu Leu Ser Leu Thr Leu 
1 5 10 15 



Ala Phe Ala Val Pro Cys Ser Gly Val Glu Ala Gly Pro Cys Val Pro 
40 20 25 . 30 

Arg Ser His Gly Cys Ser Ser Trp Glu Ala Ser Val Cys Val Thr Ser 
35 40 45 

45 Ser Thr Pro Gly Gly Ser Trp Arg Ala Arg Ala Leu Phe Pro Ser Ala 
50 55 60 



50 



. Ala Trp His Arg Xaa Ala Ala Trp Asp Ser Pro Trp Thr Gin Thr Gly 
65 70 .75 80 

Asp Phe Ala Arg Gly Ala Met Gly Gly Ala Gly Ala Leu Pro Gly Gly 
85 90 95 



Cys Val Cys He Ser Gly Arg Pro Arg Ala Gin Lys Leu Pro Ala Leu 

55 100 105 no 

Xaa 



60 
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' (2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

Thr His He His Thr His He He He Cys Ser Ser Val Xaa 
15 10 



(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

Met Glu Asn Phe Phe Phe Ser Phe Tyr Leu Phe Leu He Thr Leu He 
15 10 15 

Pro Asn Gly Arg Thr Leu Ser Thr Thr Ala Asp His Cys Lys He Pro 
20 25 30 

Cys He Xaa 
35 



(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ^35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 305: 

Met Glu Leu Trp Glu Leu Ala Leu Cys Leu Leu Val Ala Leu Ser Ala 
1 5 10 15 

His Met Phe Thr Val Gin Leu Leu Ala Asp Leu Gly Phe Leu Phe Gly 
20 25 30 

Gly Phe Xaa 
35 



(2) INFORMATION FOR SEQ ID NO: 306: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOI: SBQ ID NO: 306: 
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Met Gly Ala 31y lie leu Ala Leu L&j. Leu Pro Leu Glu Ser Val Leu 
1 5 10 . 15 

Thr Cys Ser rrp lie Ser Val Ser Thr Ser Glu Arg Gin Leu Trp Gin 
2C 25 30 

Ser Ser Gin Lys Ala Thr He Leu Ser Leu Lys Leu Asp Ser Cys Phe 
35 , 40 45 

Cys Gly His Ser Gly leu Lys Gly Lys Asn Glu Asp Thr Asp Ser Ser 
50 55 60 

Val Pro He He Pre Ser Lys Thr His Thr His Leu Gly Lys His Leu 
55 70 75 80 

He Xaa 



(2) INFCH5ff.TICK ?Cr. SEQ ID TO: 307: 

(i) SEQL-ZNCH CHARACTHPJISTICS: 

{A) iSXaTIH: 72 airdno acids 

(3) TYPE: aaino acid 

(D) TCPCLCXr/: linear 
{xi) SHQUSvCE CHSCHIPTICN: SEQ ID NO: 307: 

Xet Phe Tyr Phe Val Leu Phe He Tyr Ser Ser Ser Glu Thr Trp Ser 
1 5 10 15 

Gly Ser Val Ala Gli: As? Gly Val His Gly Val He He Gly His Cys 
20 25 30 

Ser Val Glu Leu Prs 'Gly Ser Gly Asp Pro Pro Ala Ser Ala Xaa Leu 
35 40 45 

Val Ala Gly Thr He Gly Thr Cys Pro Thr Met Pro Gly Phe Val Tyr 
50 55 60 

Phe Leu Asn Asp Val Xaa Asn Xaa 
55 ' 70 



(2) IMFCSMATICK FC?. S5Q ID NO: 308: 

(i) SEQL^ZNCZ CHAHACTERISTICS: 

(A) LE>?GrH: 34 amino acids 

(B) TYP3: amino acid 
(D) TOPOLOGY: linear 

<xi} SEQIUEISCS DESCRIPTICN: SEQ ID NO: 308: 

Met Asp Ser Thr Leu Arg Gin Gly Arg Xaa Leu Leu Thr Leu Val Pro 
1 5 10 ' 15 

Ala Ser Leu Phe Ser Leu Thr Leu Gly Gly Pro Gly Pro Trp Lys Asp 
20 25 30 



Pro Xaa 
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(2) INFORMATI(»I FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

Met Gin Val Val Gly Ser Trp Pro Gly Arg Val Gly Val Val Gly Leu 
1 5 10 15 

Ala Phe Ser Leu Val lie Pro Pro Pro Ala lie Cys lie Ala Gly Pro 
20 25 30 

Ala Pro Gly Leu Gly Gly Gly Glu Arg Gin Gin Lys Gly Leu Gly Arg 
35 40 45 

Gly Gly Gly Gly Leu Arg Asn Cys Pro Gly Arg Val Gly Met Ala Ala 
50 55 60 

Glu Pro Gly Ala Leu Leu Cys Leu Thr Ser Arg Asp Gly Ser Leu Leu 
65 70 75 80 

Leu Ser Cys Val Arg Pro His His Val lie Lys Pro Lys Gly Thr Ala 
85 90 95 

Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Xaa Xaa 
100 105 110 

Gly Gly Xaa 
115 



(2) INFORMATICS FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

Met Asp Leu Pro Gin Phe He Tyr Leu Phe He Phe Cys Phe Cys Cys 
1 5 10 15 

Leu Ala He Val Asn Asn Ala Ser He Asn He His He Gin Val Ser 
20 25 30 

Met Trp Leu Tyr Val Phe He Ser Leu Gly Tyr Leu His Gly Ser Arg 
35 40 45 

He Leu Gly His Asn He He Leu Cys Leu Thr Ser . Gin Arg He Ala 
50 55 60 

Lys Arg Phe Phe He Val Ala Ala Ser Phe Thr Phe Pro Pro Ala Met 
65 70 75 80 
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Tyr Lys Asp Phe Tyr Phe Ser lie Ser lieu His Leu Pro Thr Leu Leu 
85 90 95 



5 Phe Xaa Xaa Xaa Phe Val Phe Ser Leu Leu Pro Pro 
100 105 



10 . (2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

Pro Ser Leu Ser Ser Ser Pro Pro Pro Leu Leu Gin Val 
5 10 15 

Phe Phe Phe Ser Pro His Trp Ala Ala Lys Val Val Pro 
20 25 30 

Gin Trp Lys Xaa Arg His Pro Gin Val Ser Ser Gin Leu Leu Leu Cys 
25 35 40 45 

Phe Leu Arg Val Asn Cys Gin Phe Leu Phe Leu Gin Glu He Leu Phe 
50 55 . 60 

30 Xaa 
65 



Met Cys Ser 
1 

20 

Phe Phe Phe 



35 (2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: "50 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEC^JENCE' DESCRIPTION: SEQ ID NO: 312: 

Leu Ser Arg Trp Lys He Phe Tyr Thr Leu Leu He Leu Phe 
5 10 15 

Phe Ser He Thr Ser Glu Xaa Glu Thr Phe Tyr Mtet He He 
20 25 30 

He His His Asn Pro Thr Gin He Thr Ala Ser Cys Ser * Phe Thr Phe 
50 35 40 45 

Leu Xaa 
50 

55 

(2) INFORMATION FOR SEQ ID NO: 313: 



Met Cys 
1 

45 

Xaa Xaa 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 293 amino acids 
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(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

5 Met Glu Arg Pro Asp Trp Glu Thr Ala He Gin Lys Pro Leu Cys Ser 
15 10 15 

Leu Pro Ala Gly Ser Gly Asn Ala Leu Ala Ala Ser Leu Asn His Tyr 
20 25 30 

10 

Ala Gly Tyr Xaa Gin Val Thr Asn Glu Asp Leu Leu Thr Asn Cys Thr 
35 40 45 • 

Leu Leu Leu Cys Arg Arg Leu Leu Ser Pro Met Asn Leu Leu Ser Leu 
15 50 55 60 

His Thr Ala Ser Gly Leu Arg Leu Phe Ser Val Leu Ser Leu Ala Trp 
65 70 75 80 

20 Gly Phe He Ala Asp Val Asp Leu Glu Ser Glu Lys Tyr Arg Arg Leu 

85 90 95 

Gly Glu Met Arg Phe Thr Leu Gly Thr Phe Leu Arg Leu Ala Ala Leu 
100 105 110 

25 

Arg Thr Tyr Arg Gly Arg Leu Ala Tyr Leu Pro Val Gly Arg Val Gly 
115 120 125 

Ser Lys Thr Pro Ala Ser Pro Val Val Val Gin Gin Gly Pro Val Asp 
30 130 135 ' 140 

Ala His Leu Val Pro Leu Glu Glu Pro Val Pro Ser His Trp Thr Val 
145 150 155 160 

35 Val Pro Asp Glu Asp Phe Val Leu Val Leu Ala Leu Leu His Ser His 
165 170 175 

Leu Gly Ser Glu Met Phe Ala Ala Pro Met Gly Arg Cys Ala Ala Gly 
180 185 190 

40 

Val Met His Leu Phe Tyr Val Arg Ala Gly Val Ser Arg Ala Met Leu 
195 200 205 

Leu Arg Leu Phe Leu Ala Met Glu Lys Gly Arg His Met Glu Tyr Glu 
45 210 215 220 

Cys Pro Tyr Leu Val Tyr Val Pro Val Val Ala Phe Arg Leu Glu Pro 
225 230 235 240 

50 Lys Asp Gly Lys Gly Val Phe Ala Val Asp Gly Glu Leu Met Val Ser 
245 250 255 

Glu Ala Val Gin Gly Gin Val His Pro Asn Tyr Phe Trp Met Val Ser 
260 265 270 

55 

Gly Cys Val Glu Pro Pro Pro Ser Trp Lys Pro Gin Gin Met Pro Pro 
275 280 285 



60 



Pro Glu Glu Pro Leu 
290 
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(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC^I : SEQ ID NO: 314: 

Met Pro Leu Glu Gly Phe Cys Leu Val Leu Asp lie Gly Phe Leu Leu 
1 5 10 15 

Val Met Leu He Ser Leu Ala Ser Glu Cys Phe Thr Thr Cys Leu Asp 
20 25 30 

Ser Phe Ser Thr Thr Glu Pro Gly Cys Lys Phe Tyr Lys Leu Leu His 
35 40 45 

Ser Val Ser Leu Leu Asn lie Asn Phe Asn Val Lys Ser Leu Leu Cys 
50 55 60 

Ser His He Xaa 
65 



(2) IJ3F0RMATI0N FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SE(^}ENCE DESCRIPTION: SEQ ID NO: 315: 

Met Pro Leu Gin Leu Ser Gly Gin Tyr Trp He Ser Leu Leu Val Phe 
1 5 * 10 15 

Leu Ser Leu Gin Pro Phe Pro Gin Ala Ala He Pro Cys Ala Leu Thr 
20 25 30 

Asp Val Gly Gly Ser Cys Val He Cys His He Leu Leu Asn Cys Leu 
35 40 45 

Cys He Leu Phe Thr Leu Thr Ala Pro Ser Leu Ser His Val Leu Leu 
50 55 60 

He Lys Met Ser Leu Ser Val Cys Tyr Glu Pro Gly Ala Asp Leu Ser 
65 70 75 80 

Asp Arg Ala Ala Thr Gly Asn Lys Lys Leu Thr Arg Ser Thr Cys Leu 
85 90 95 

Leu Met His Ser Asn Lys Leu Cys Xaa 
100 105 



(2) INFORMATION FOR SEQ ID NO: 316: 
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(i) SEQUENCE CHAKACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 

Met Trp Gly Cys Ser Gly Leu Gly His Arg Thr Val Ser Phe Leu Leu 
15 10 15 

Leu Leu Pro Cys Ser Phe Pro Arg Pro Cys Xaa Leu Phe Gly Leu lie 
20 25 30 

Pro lie Ser Arg Pro Cys Lys Val Glu Ala Pro Arg Leu Ser Val Pro - 
35 40 45 

Xaa Leu Ser Cys Ala Ser His Pro Tyr Cys Asn Cys Pro Met Ser Thr 
50 55 60 

Ser Cys Pro Leu Pro Arg Xaa 
65 70 



(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQtffiNCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

Met Leu Asn Val Leu Ser Lys Val Gin Gin Leu Val Ser Xaa Leu Gly 
1 . 5 10 15 

Leu Val Thr Phe Leu Leu Asn His Ser Ala Ala Gly Gly Ser Pro Gin 
20 25 30 

His Arg Trp Leu Leu Leu Xaa 
35 



(2) INFORMATICS? FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IaWGTH: 72 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

Met Lys Ala lie Ala Arg Ala Cys Leu Leu Leu Ser Leu Leu Val Leu 
15 10 15 

Pro His Val Val Ser Glu His Leu Phe Trp His His Asn Pro Arg His 
20 25 30 

Pro Val He Trp Pro Phe Pro Pro Phe His Leu He Ser Cys Ser Val, 
35 40 45 
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Ser Ala Ser Thr Trp His Leu Gly Glu Xaa Leu Leu Leu Leu Val Pro 
50 55 60 

lie Ala Pro Ser Val Trp Ser Xeia 
65 10 



(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

Met Glu Gin Gly Gly Gly Pro Arg Leu Leu Leu Leu lie Pro Gly Leu 
15 10 15 

Leu His Asn Thr Tyr Leu Ala Arg Pro Gly Asp Phe Pro Ala Gin Gly 
20 . 25 . 30 

Thr Thr Glu Asn Thr Glu Cys Gin Gly Ser Pro Ser Pro He Ser His 
35 40 45 

Leu Gly Lys Val Arg Ser Leu Asp Ser Asn Thr Gin He Xaa 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOI: SEQ ID NO: 320: 

Met Pro Leu Leu Phe Phe Ser Val Ser Thr Leu Phe Ser Gly Ser Val 
15 10 15 

Thr Leu Gin Gin Arg Gly Met Phe Leu Pro Trp Thr Gly Thr Gly Glu 
20 25 30 



Gin Val Leu Ala Leu Leu Tzp Pro Arg Phe Glu Leu He Leu Glu Met 
35 40 45 

Asn Val Gin Ser Val Arg Ser Thr Asp Pro Gin Arg Leu Gly Gly Leu 
50 55 60 

Asp Thr Arg Pro His Tyr He Thr Arg Arg Tyr Ala Glu Phe Ser, Ser 
65 .70 75 80 

Ala Leu Val Ser He Asn Gin Thr He Pro Asn Glu Arg Thr Met Gin 
85 90 95 



Leu Leu Gly Gin Leu Gin Val Glu Val Glu Asn Phe Val Leu Arg Val 
100 105 110 



Ala Ala Glu Phe Ser Ser Arg Lys Glu Gin Leu Val Phe Leu He Asn 
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115 



120 



125 



Asn Tyr Asp Met Met Leu Gly Val Leu Met Glu Arg Ala Ala Asp Asp 
130 135 140 



Ser Lys Glu Val Glu Ser Phe Gin Gin Leu Leu Asn Ala Arg Thr Gin 
145 150 155 160 

Glu Phe lie Glu Glu Leu Leu Ser Pro Pro Phe Gly Gly Leu Val Ala 
165 170 175 

Phe Val Lys Glu Ala Glu Ala Leu lie Glu Arg Gly Gin Ala Glu Arg 
180 185 190 

Leu Arg Gly Glu Glu Ala Arg Val Thr Gin Leu lie Arg Gly Phe Gly 
195 200 205 



Ser Ser Trp Lys Ser Ser Val Glu Ser Leu Ser Gin Asp Val Met Arg 
210 215 220 

Ser Phe Thr Asn Phe Arg Asn Gly Thr Ser lie He Gin Gly Ala Leu 
225 230 235 240 

Thr Gin Leu He Gin Leu Tyr His Arg Phe His Arg Val Leu Ser Gin 
245 250 255 

Pro Gin Leu Arg Ala Leu Pro Ala Arg Ala Glu Leu He Asn He His 
260 265 270 

His Leu Met Val Glu Leu Lys Lys His Lys Pro Asn Phe Xaa 
275 280 285 



(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TVPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 



Met Phe Arg Ala Leu Arg Asp Leu Leu Thr His Tyr Pro Gin Gin He 
15 10 15 

Leu Leu Gin Val Leu Val Val Met Tyr Gin Val Leu Gin Val Trp Glu 
20 25 30 

Leu Pro Trp Pro Glu Leu He His Leu Gin Gly He Val Pro Thr Asp 
35 40 45 

Gin Leu His Leu Lys Gin Xaa 
50 55 



(2) INFORMATION FOR SEQ ID NO: 322: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEMxTH: 59 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

Asp Phe Val Pro Val Leu Val Phe Val Leu lie Lys Ala Asn Pro Pro 
15 10 15 

Cys Leu Leu Ser Thr Val Gin Tyr lie Ser Ser Phe Tyr Ala Ser Cys 
20 25 30 

Leu Ser Gly Glu Glu Ser Tyr Trp Trp Met Gin Phe Thr Ala Ala Val 
35 40 45 

Glu Phe lie Lys Thr lie Asp Asp Arg Lys Xaa 
50 55 



(2) JNFOFMATIW FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 323: 

Met His Pro Ala Arg Lys Leu Leu Ser Leu Leu Phe Leu lie Leu Met 
15 10 15 

Gly Thr Glu Leu Thr Gin Asp Ser Ala Ala Pro Asp Ser Leu Leu Arg 
20 25 30 . 

Ser Ser Lys Gly Ser Thr Arg Gly Ser Leu Ala Ala lie Val He Trp 
35 40 45 

Arg Gly Lys Ser Glu Ser Arg He Ala Lys Thr Pro Gly He Phe Arg 
50 55 60 

Gly Gly Gly Thr Leu Val Leu Pro Pro Thr His Thr Pro Glu Trp Leu 
65 70 75 80 

He Leu Pro Leu Gly He Thr Leu Pro Leu Gly Ala Pro Glu Thr Gly 
85 90 95 

Gly Gly Asp Cys Ala Ala Glu Thr Trp Lys Gly Ser Gin Arg Ala Gly 
100 105 110 

Gin Leu Cys Ala Leu Leu Ala Xaa 
115 120 



(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
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Phe Phe Leu Val Val Phe Ser Leu Ser Phe Xaa Pro Ser Val Leu Thr 
1 5 10 15 

Ser Pro Val His Xaa Pro His Cys Cys Gin Xaa Asp Xaa lie Leu Phe 
20 25 30 

Lys.Asn Thr Leu Xaa Xaa Phe Xaa Ala Lys Tyr Xaa 
35 40 



(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

Met Phe Ser Arg Thr Ser Asn Phe Trp Thr Phe Phe Phe Gin Phe Leu 
15 10 15 

He Phe Lys Val Phe Leu Val Leu Lys Asn Xaa Phe Thr Ser Gin Lys 
20 25 30 

He Xaa Xaa He Xaa Xaa Glu Lys Pro Lys Lys Lys Lys Xaa Arg Gly 
35 40 45 

Gly Arg Ala Pro Ser Pro Gin Gly Gly Pro Xaa 
50 55 



(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ^18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 326: 

Met Gly Leu Leu He Phe Met Leu Leu He Gly He His Ser Gin Cys 
1 5 10 15 

Ser Xaa 



(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SBQXJEiiJCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 



M^t Val Leu Phe Cys Phe Val Leu Phe Cys Phe Val Phe Glu Met Asp. 
15 10 15 
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Ser Ser Ser Val Thr Gin Ala Gly VaL Gin Trp Cys Asp Leu Gly Ser 
20 25 30 

Leu Gin Ala Pro Pro Pro Gly Phe Ser Pro Phe Ser Cys Leu Ser Leu 
35 40 45 

Pro Ser Ser Trp Asp Tyr Arg Arg Pro Pro Pro Arg Pro Ala Asn Phe 
50 55 60 

Leu Tyr Phe Leu Val Glu Thr Gly Phe His His Val Ser Gin Asp Gly 
65 70 75 80 

Leu Asp Leu Leu Thr Ser Xaa 
85 



(2) INFORMATIOM FOR SEQ ID NO: 328: 

(i) SEQUENCE CHAEy^CTERISTICS: 

(A) LENGTH: 538 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: . 

Met Ser Thr Lys Lys Leu Cys lie Val Gly Gly He Leu Leu Val Phe 
15 10 15 

Gin He He Ala Phe Leu Val Gly Gly Leu He Ala Pro Gly Pro Thr 
20 25 30 

Thr Ala Val Ser Tyr Met Ser Val Lys Cys Val Asp Ala Arg Lys Asn 
35 40 45 

His His Lys Thr Lys Trp Phe Val Pro Trp Gly Pro Asn His Cys Asp 
50 55 60 

Lys He Arg Asp He Glu Glu Ala He Pro Arg Glu He Glu Ala Asn 
65 70 75 80 

Asp He Val Phe Ser Val His He Pro Leu Pro His Met Glu Met Ser 
85 90 95 

Pro Trp Phe Gin Phe Met Leu Phe He Leu Gin Leu Asp He Ala Phe 
100 105 110 

Lys Leu Asn Asn Gin He Arg Glu Asn Ala Glu Val Ser li&t Asp Val 
115 120 125 

Ser Leu Ala Tyr Arg Asp Asp Ala Phe Ala Glu Trp Thr Glu Met Ala 
130 135 140 

His Glu Arg Val Pro Arg Lys Leu Lys Cys Thr Phe Thr Ser Pro Lys 
145 150 155 160 

Thr Pro Glu His Glu Gly Arg Tyr Tyr Glu Cys Asp Val Leu Pro Phe 
165 170 175 

Met Glu He Gly Ser Val Ala His Lys Phe Tyr Leu Leu Asn He Arg 
180 185 190 
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Leu Pro Val Asn Glu Lys Lys Lys lie Asn Val Gly He Gly Glu He 
195 200 205 

Lys Asp He Arg Leu Val Gly He His Gin Asn Gly Gly Phe Thr Lys 
210 215 220 

Val Trp Phe Ala Met Lys Thr Phe Leu Thr Pro Ser He Phe He He 
225 230 235 240 

Met Val Trp Tyr Trp Arg Arg He Thr Met Met Ser Arg Pro Pro Val 
245 250 255 

Leu Leu Glu Lys Val He Phe Ala Leu Gly He Ser Met Thr Phe He 
260 265 270 

Asn He Pro Val Glu Trp Phe Ser He Gly Phe Asp Trp Thr Trp Met 
275 280 285 

Leu Leu Phe Gly Asp He Arg Gin Gly He Phe Tyr Ala Met Leu Leu 
290 295 300 

Ser Phe Trp. He He Phe Cys Gly Glu His Met Met Asp Gin His Glu 
305 310 315 320 

Arg Asn His He Ala Gly Tyr Trp Lys Gin Val Gly Pro He Ala Val 

325 330 335 

Gly Ser Phe Cys Leu Phe He Phe Asp Met Cys Glu Arg Gly Val Gin 
340 345 350 

Leu Thr Asn Pro Phe Tyr Ser He Trp Thr Thr Asp He Gly Thr Glu 
355 360 365 

Leu Ala Met Ala Phe He He Val Ala Gly He Cys Leu Cys Leu Tyr 
370 375 380 

Phe Leu Phe Leu Cys Phe Met Val Phe Gin Val Phe Arg Asn He Ser 
385 390 395 400 

Gly Lys Gin Ser Ser Leu Pro Ala Met Ser Lys Val Arg Arg Leu His 
405 410 415 

Tyr Glu Gly Leu He Phe Arg Phe Lys Phe Leu Met Leu He Thr Leu 
420 425 430 

Ala Cys Ala Ala Met Thr Val He Phe Phe He Val Ser Gin Val Thr 
435 440 445 

Glu Gly His Trp Lys Trp Gly Gly Val Thr Val Gin Val Asn Ser Ala 
450 455 460 

Phe Phe Thr Gly He Tyr Gly Met Trp Asn Leu Tyr Val Phe Ala Leu 
465 470 475 480 

Met Phe Leu Tyr Ala Pro Ser His Lys Asn Tyr Gly Glu Asp Gin Ser 
485 490 495 



Asn Gly Met Gin Leu Pro Cys Lys Ser Arg Glu Asp Cys Ala Leu Phe 
500 505 510 
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Val Ser Glu Leu Tyr Gin Glu Leu Phe Ser Ala Ser Lys.Tyr Ser Phe 
515 520 525- 

lie Asn Asp Asn Ala Ala Ser Gly lie Xaa 
530 535 



(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 202 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

Met Gly He Ala Leu Ala Val Leu Gly Trp Leu Ala Val Met Leu Cys 
1 5 10 15 

Cys Ala Leu Pro Met Trp Arg Val Thr Ala Phe He Gly Ser Asn He 
20 25 30 

Val Thr Ser Gin Thr He Trp Glu Gly Leu Trp Met Asn Cys Val Val 
35 40 45 

Gin Ser Thr Gly Gin Met Gin Cys Lys Val Tyr Asp Ser Leu Leu Ala 
50 55 60 

Leu Pro Gin Asp Leu Gin Ala Ala Arg Ala Leu Val He He Ser He 
65 70 75 ■ 80 

He Val Ala Ala Leu Gly Val Leu Leu Ser Val Val Gly Gly Lys Cys 
85 90 95 

Thr Asn Cys Leu Glu Asp Glu Ser Ala Lys Ala Lys Thr Met He Val 
100 105 110 

Ala Gly Val Val Phe Leu Leu Ala Gly Leu Met Val He Val Pro Val 
115 120 125 

Ser Trp Thr Ala His Asn He He Gin Asp Phe Tyr Asn Pro Leu Val 
130 135 140 

Ala Ser Gly Gin Lys Arg Glu Met Gly Ala Ser Leu Tyr Val Gly Trp 
145 150 155 160 

Ala Ala Ser Gly Leu Leu Leu Leu Gly Gly Gly Leu Leu Cys Cys Asn 
165 170 175 

Cys Pro Pro Arg Thr Asp Lys Pro Tyr Ser Ala Lys Tyr Ser Ala Ala 
180 185 190 

Arg Ser Ala Ala Ala Ser Asn Tyr Val Xaa 
195 200 



(2) INFORMATION FOR SEQ ID NO: 330: 



wo 98/54963 PCT/US98/1 1422 

534 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

Met Ala Thr Val Thr Ala Thr Thr Lys Val Pro Glu He Arg Asp Val 
1 5 10 15 

10 Thr Arg He Glu Arg He Gly Ala His Ser His He Arg Gly Leu Gly 
20 25 30 

Leu Asp Asp Ala Leu Glu Pro Arg Gin Ala Ser Gin Gly Met Val Gly 
35 "40 45 

15 

Gin Leu Ala Ala Arg Arg Ala Ala Gly Val Val Leu Glu Met He Arg 
50 55 60 

Glu Gly Lys He Ala Gly Arg Ala Val Leu He Ala Gly Gin Pro Gly 
20 65 70 75 80 

Thr Gly Lys Thr Ala He Ala Met Gly Met Ala Gin Ala Leu Gly Pro 
85 90 95 

25 Asp Thr Pro Phe Thr Ala He Ala Gly Ser Glu He Phe Ser Leu Glu 
100 105 110 

Met Ser Lys Thr Glu Ala Leu Thr Gin Ala Phe Arg Arg Ser He Gly 
115 120 125 

30 

Val Arg He Lys Glu Glu Thr Glu lie He Glu Gly Glu Val Val Glu 
130 135 140 

He Gin He Asp Arg Pro Ala Thr Gly Thr Gly Ser Lys Val Gly Lys 
35 145 150 155 160 

Leu Thr Leu Lys Thr Thr Glu Met Glu Thr lie Tyr Asp Leu Gly Thr 
165 ' * 170 175 

40 Lys Met He Xaa Ser Leu Thr Lys Asp Lys Val Gin Ala Gly Asp Val 
180 185 190 

He Thr He Asp Lys Ala Thr Gly Lys He Ser Lys Leu Gly Arg Ser 
195 200 205 

45 

Phe Thr Arg Ala Arg Glu Leu Arg Arg Tyr Gly Leu Pro Asp Gin Val 
210 215 220 

Arg Ala Val Pro Arg Trp Gly Ala Pro Glu Thr Gin Gly Gly Gly Ala 
50 225 230 235 240 

His Arg Val Pro Ala Arg Asp Arg Arg His Gin Leu Ser His Pro Gly 
245 250 255 



55 Leu Pro Gly Ala Leu Leu Arg 
260 



60 (2) INFORMATION FOR SEQ ID NO: 331: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

Met Leu Ala Leu Leu Gly Leu Ser Gin Ala Leu Asn He Leu Leu Gly 
15 10 15 

Leu Lys Gly Leu Ala Pro Ala Glu He Ser Ala Val Cys Glu Lys Gly 
20 25 30 

Asn Phe Asn Val Ala His Gly Leu Ala Trp Ser Tyr Tyr He Gly Tyr 
35 40 45 

Leu T^g Leu He Leu Pro Glu Leu Gin Ala Arg He Arg Thr Tyr Asn 
50 55 60 

Gin His TVr Asn Asn Leu Leu Arg Gly Ala Val Ser Gin Arg Leu Tyr 
^5 70 75 80 

He Leu Leu Pro Leu Asp Cys Gly Val Pro Asp Asn Leu Ser Met Ala 
85 90 95 

Asp Pro Asn He Arg Phe Leu Asp Lys Leu Pro Gin Gin Thr Gly Asp 
100 105 110 

Arg Ala Gly He Lys Asp Arg Val Tyr Ser Asn Ser He Tyr Glu Leu 
U5 120 125 

Leu Glu Asn Gly Gin Arg Ala Gly Thr Cys Val Leu Glu Tyr Ala Thr 
130 135 140 

Pro Leu Gin Thr Leu Phe Ala Met Ser Gin Tyr Ser Gin Ala Gly Phe 
145 150 155 160 

Ser Gly Glu Asp Arg Leu Glu Gin Ala Lys Leu Phe Cys Arg Thr Leu 
165 170 175 

Glu Asp He Leu Ala Asp Ala Pro Glu Ser Gin Asn Asn Cys Arg Leu 
180 185 190 

He Ala Tyr Gin Glu Pro Ala Asp Asp Ser Ser Phe Ser Leu Ser Gin 
195 200 205 

Glu Val Leu Arg His Leu Arg Gin Glu Glu Lys Glu Glu Val Thr Val 
210 215 220 

Gly Ser Leu Lys Thr Ser Ala Val Pro Ser Thr Ser Thr Met Ser Gin 
225 230 235 240 

Glu Pro Glu Leu Leu He Ser Gly Met Glu Lys Pro Leu Pro Leu Arg 
245 250 255 

Thr Asp Phe Ser 
260 
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(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

Met Thr Pro Gin Lys Pro Ala Leu Ala Val Leu Leu Leu Glu Val Pro 
10 1 5 10 15 

Leu Leu Leu Thr Leu Ser Val Leu Lys Lys Arg Cys Leu Val Thr Cys 
20 25 30 

15 Glu Pro Thr Ser Arg Phe Val Ser Cys Asp Leu Pro Leu Ser Val Xaa 
35 .40 45 



20 



(2) INFORMATION FOR SEQ ID NO: 333: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

30 

Met Ala Ala Ala Ala Trp Leu Gin Val Leu Pro Val lie Leu Leu Leu 
15 10 15 

Leu Gly Ala His Pro Ser Pro Leu Ser Phe Phe Ser Ala Gly Pro Ala 
35 20 25 30 

Thr Val Ala Ala Ala Asp Arg Ser Lys Trp His lie Pro lie Pro Ser 
35 40 45 

40 Gly Lys Asn Tyr Phe Ser Phe Gly Lys lie Leu Phe Arg Asn Thr Thr 
50 55 60 

lie Phe Leu Lys Phe Asp Gly Glu Pro Cys Asp Leu Ser Leu Asn lie 
65 70 75 80 

45 

i Thr Trp Tyr Leu Lys Ser Ala Asp Cys Tyr Asn Glu lie Tyr Asn Phe 
85 90 95 

Lys Ala Glu Glu Val Glu Leu Tyr Leu Glu Lys Leu Lys Glu Lys Arg 
50 100 105 110 

Gly Leu Ser Gly Lys Tyr Gin Thr Ser Ser Lys Leu Phe Gin Asn Cys 
115 120 125 

55 Ser Glu Leu Phe Lys Thr Gin Thr Phe Ser Gly Asp Phe Met His Arg 
130 135 140 

Leu Pro Leu Leu Gly Glu Lys Gin Glu Ala Lys Glu Asn Gly Thr Asn 
145 150 155 160 

60 
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Leu Thr Phe lie Gly Asp Lys Thr Ala Met His Glu Pro Leu Gin Thr 
. 165 170 175 

Trp Gin Asp Ala Pro Tyr lie Phe He Val His He Gly He Ser Ser 
5 180 185 190 

Ser Lys Glu Ser Ser Lys Glu Asn Ser Leu Ser Asn Leu Phe Thr Met 
195 200 205 

10 Thr Val Glu Val Lys Gly Pro Tyr Glu Tyr Leu Thr Leu Glu Asp Tyr 
210 215 220 

Pro Leu Met He Phe Phe Met Val Met Cys He Val Tyr Val Leu Phe 
225 230 235 240 

15 

Gly Val Leu Trp Leu Ala Trp Ser Ala Cys Tyr Trp Arg Asp Leu Leu 
245 250 255 

Arg He Gin Phe Trp He Gly Ala Val He Phe Leu Gly Met Leu Glu 
20 260 265 270 

Lys Ala Val Phe Tyr Ala Glu Phe Gin Asn He Arg Tyr Lys Gly Xaa 
275 280 285 

25 Ser Val Gin Gly Ala Leu He Leu Ala Glu Leu Leu Ser Ala Val Lys 
290 295 300 

Arg Ser Leu Ala Arg Thr Leu Val He He Val Ser Leu Gly Tyr Gly 
305 310 315 320 

30 

He Val Lys Pro Arg Leu Glu Ser Leu Phe He Arg Leu Xaa 
325 330 



35 

(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 
40 (6) TYPE: amino acid 

<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 334: 

Met Val Leu Xaa Val Val Thr Leu Gly Leu Ala Leu Phe Thr Leu Cys 
45 1 5 10 15 

Gly Lys Phe Lys Arg Trp Lys Leu Asn Gly Ala Phe Leu Leu He Thr 
20 25 30 

50 Ala Phe Leu Ser Val Leu He Trp Val Ala Trp Met Thr Met Tyr Leu 
35' 40 45 

Phe Gly Asn Val Lys Leu Gin Gin Gly Asp Ala Trp Asn Asp Pro Thr 
50 55 60 

55 

Leu Ala He Thr Leu Ala Ala Ser Ala Gly Ser Ser Ser Ser Ser Thr 
65 70 75 80 

Pro Ser Leu Arg Ser Thr Ala Pro Phe Cys Gin Pro Cys Arg Arg Thr 
60 85 90 .95 
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Arg Pro Thr Thr Ser Thr Arg Arg Ser Pro Gly Cys Gly Arg Arg Pro 
100 105 110 

Ser Arg Arg Thr Cys Ser Cys Arg Gly Pro lie Trp Arg Thr Arg Pro 

115 120 125 

Ser Pro Trp Met Asn Thr Met Gin Leu Ser Glu Gin Gin Asp Phe Pro 
130 135 140 

Thr Ala Ala Trp Glu Lys Asp Pro Val Ala Ala Trp Gly Lys Asp Pro 
145 150 155 160 

Ala Leu Arg Leu Glu Ala Thr Cys lie Ser Gin Leu Arg Trp Pro Ser 
165 170 175 

Cys Ser Thr Val Gly Pro Ser Gin Leu Leu Arg Gin Val Thr Gin Glu 
180 185 190 

Xaa Thr Phe Gly Glu Arg Leu Xaa 
195 200 



(2) INFORMATI<:»J FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOJ: SEQ ID NO: 335: 

Met Leu Leu His His Gin Leu Leu lie Val Thr Leu His Leu Val Leu 
15 10 15 

Leu Leu Ala Thr Leu Leu Val Xaa . 
20 



(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I<ENGTH: 143 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE INSCRIPTION: SEQ ID NO: 336: 

Met Thr Lys Ala' Leu Leu lie TVr Leu Val Ser Ser Phe Leu Ala Leu 
15 10 15 

Asn Gin Ala Ser Leu lie Ser Arg Cys Asp Leu'Ala Gin Val Leu Gin 
20 25 30 

Leu Glu Asp Leu Asp Gly Phe Glu Gly Tyr Ser Leu Ser Asp Trp Leu 
35 40 45 

Cys Leu Ala Phe Val Glu Ser Lys Phe Asn lie Ser Lys lie Asn Glu* 
50 55 ' 60 



wo 98/54963 



539 



Asn Ala Asp Gly Ser Phe Asp Tyr Gly Leu Phe Gin lie Asn Ser His 
65 70 75 80 

Tyr Trp Cys Asn Xaa Tyr Lys Ser Tyr Ser Glu Asn Leu Cys His Val 
85 90 95 

Asp Cys Gin Asp Leu Leu Asn Pro Asn Leu Leu Ala Gly lie His Cys 
100 105 • 110 

Ala Lys Arg lie Val Ser Gly Ala Arg Gly Met Asn Asn Trp Val Arg 
115 120 125 

Met Glu Xaa Cys Thr Val Gin Ala Gly His Ser Ser Thr Gly Xaa 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

Met Leu Val lie Ala Gly Gly lie Leu Ala Ala Leu Leu Leu Leu lie 
15 10 15 

Val Val Val Leu Cys Leu Tyr Phe Lys lie His Asn Ala Leu Lys Ala 
20 25 30 

Ala Lys Glu Pro Glu Ala Val Ala Val Lys Asn His Asn Pro Asp Lys 
35 40 45 

Val Trp Trp Ala Lys Asn Ser Gin Ala Lys Thr lie Ala Thr Glu Ser 
50 55 60 

Cys Pro Ala Leu Gin Cys Cys Glu Gly Tyr Arg Met Cys Ala Ser Phe 
65 70 75 80 

Asp Ser Leu Pro Pro Cys Cys Cys Asp He Asn Glu Gly Leu Xaa 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

Met Leu I^eu Lys Ser Asn He Leu Met Leu Asn Leu Phe Ala Ala Asn 
1 .5 10 15 

Val Gly Ala Asn Phe Ala Leu Thr Val Glu Lys He Gly Met He Leu 
20 25 30 



Leu Asn Val Ser Gly Xaa 
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35 



(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

Met Leu Val Val Ala Phe Gly Leu Leu Val Leu Tyr lie Leu Leu Ala 
1.5 10 15 

Ser Ser Trp Lys Arg Pro Glu Pro Gly lie Leu Thr Asp Arg Gin Pro 
20 25 30 

Leu Leu His Asp Gly Glu Xaa 
35 



(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 

Ser Asp Pro Leu Ala Ser Ala Ser Gin Asn Ala Gly lie Val Ser Val 
15 10 15 

Gly Leu Cys Thr Arg Pro Gly Pro Gin Phe Lys Asn Ala Gin Pro Pro 
20 25 30 

Phe Pro Xaa Gin Lys Ala Pro Arg Cys Leu Trp Glu Asn Gin Pro Pro 
35 40 45 

Pro Trp Arg Lys Ala Trp Asp Leu Pro Ser His Leu Gly Arg Arg Gly 
50 55 60 • 

He Cys Gly Lys Ser Phe Xaa 
65 • 70 



(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

<B) TYPE: amino acid. 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

Tyr Val Met He Phe Lys Lys Glu Phe Ala Pro Ser Asp Glu Glu Leu 
15 10 15 

Asp Ser Tyr Arg Arg Gly Glu Glu Trp Asp Pro Gin Lys Ala Glu Glu 
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20 25 30 

Lys Arg Asn Xaa Lys Glu Leu Ala Gin Arg Gin Xaa Gly Gly Gly Ser 
35 40 45 

Pro Ala Gly Ala Cys Gly Gly Glu Pro Cys Gin Arg Leu Gin Gly Gin 
50 55 60 

Val Gin Pro Pro His Arg Gin Gly Ser Ser Gin Arg Arg Ser Pro His 
65 70 75 80 

Ala Thr Gly Gin Xaa 
85 



{2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

Met Trp Asp Trp Asp Trp Ser Ala Pro Trp Ser Trp Pro Leu Trp Leu 
1 5 10 .15 

Ser Leu Ala Leu Val Cys Leu Ser Ala Gly Ala Lys Gly His Arg Ala 
20 25 30 

Ser Glu Ala Gly His Ala Arg Ala Leu Thr Cys Glu Met Gly Ser Glu 
35 40 45 

Phe Xaa Thr Ala Xaa Gly Leu Val Leu Gly Xaa Xaa Xaa Trp Thr Xaa 
50 55 * 60 

Xaa Asn Gly Ser Ala Gly Pro Glu Arg Arg Gly Trp Arg Pro Ala Ala 
65 70 75 80 

Phe Leu Ala Val Phe Leu Leu Gly Asp Xaa 
85 90 



(2) INFORMATION FOR SEQ ID NO: 343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid ■ 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

Met Phe Gly Pro Thr Phe His Ser Leu Val Leu Val Pro Pro Trp Pro 
15 10 15 

Asn Leu Ser Leu Leu His Phe Thr Ser Pro val Gly Gin His Ser Ser 
20 25 30 



Phe Leu Pro Thr Ser Leu Arg Leu Xaa Lys Lys Lys Lys Lys Lys Lys 
35 40 45 
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(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE(KSTH: 56 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 



Met Cys Ser Lys Asn Gly Phe Leu 

1 • 5 

Trp Leu Pro Gin Ala Ser Leu Ala 
20 

Ser Asp Leu Val Gly Cys Ser Arg 
35 40 

His Ser Glu Arg Val Gin Gly Xaa 
50 55 



Leu Ala Trp Ser Trp Asn Ser Pro 
10 15 

His Gly Cys Trp Gly Arg Trp Met 
25 30 

Glu Asn Lys Cys Ala Leu Arg Asp 
45 



(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

. (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 

Ser Pro Leu Xaa Phe Cys Val Val Leu Leu Leu Gin Ala Ala Arg Gly 
1 5 . 10 15 

Tyr Val Val Arg Lys Pro Ala Gin Ser Arg Leu Asp Asp Asp Pro Pro 

20 25 30 

Pro Ser Thr .Leu Leu Lys Asp Tyr Gin Asn Val Pro Gly lie Glu Lys 
35 40 45 



Val Asp Asp Val Val Lys Arg Leu Leu Ser Leu Glu Met Ala Asn Lys 
50 55 60 

Lys Glu Met Leu Lys lie Lys Gin Glu Gin Phe Met Lys Lys lie Val 
65 70 75 80 



Ala Asn Pro Glu Asp Thr Arg Ser Leu Glu Ala Arg lie lie Ala Leu 
85 90 95 

Ser Val Lys lie Arg Ser Tyr Glu Glu His Leu. Glu Lys His Arg Lys 
100 105 110 

Asp Lys Ala His Lys Arg Tyr Leu lieu Met Ser lie Asp Gin Arg Lys- 
115 120 125 
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Lys Met Leu Lys Asn Leu Arg Asn Thr Asn Tyr Asp Val Phe Glu Lys 
130 135 140 

He Cys Trp Gly Leu Gly He Glu Tyr Thr Phe Pro Pro Leu Tyr Tyr 
145 150 155 160 

Arg Arg Ala His Arg Arg Phe Val Thr Lys Lys Ala Leu Cys He Arg 
165 170 175 

Val Phe Gin Glu Thr Gin Lys Leu Lys Lys Arg Arg Arg Ala Leu Lys 
180 185 190 

Ala Ala Ala Ala Ala Gin Lys Gin Ala Lys Arg Arg Asn Pro Asp Ser 
195 200 205 

Pro Ala Lys Ala He Pro Lys Thr Leu Lys Asp Ser Gin Xaa 
210 215 220 . 



(2) INFORMATION FOR SBQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

Met Gly Ala Pro Ala Ala Ser Leu Leu Leu Leu Leu Leu Leu Phe Ala 
15 10 15 

Cys Cys Trp Ala Pro Gly Gly Ala Asn Leu Ser Gin Asp Asp Ser Gin 
20 ^ 25 30 

Pro Trp Thr Ser Asp Glu Thr Val Val Ala Gly Gly Thr Val Val Leu 
35 40. 45 

Lys Cys Gin Val Lys Asp His Glu Asp Ser Ser Leu Gin Trp Ser Xaa 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

Met Val Ala Pro Val Tzp Tyr Leu Val Ala Ala Ala Leu Leu Val Gly 
15 10 15 

Phe He Leu Phe Leu Thr Arg Ser Arg Gly Arg Ala Ala Ser Ala Gly 
20 25 30 



Gin Glu Pro Leu His Asn Glu Glu Leu Ala Gly Ala Gly Arg Val Ala 
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35 40 45 

Gin Pro Gly Pro Leu Glu Pro Glu Glu Pro Arg Ala Gly Gly Arg Pro 
50 55 60 

Arg Arg Arg Arg Asp Leu Gly Ser Arg Leu cin Ala Gin Arg Arg Ala 
65 70 75 80 

Gin Arg Val Ala Trp Ala Glu Ala Asp Glu Asn Glu Glu Glu Ala Val 
85 90 95 

He Leu Ala Gin Glu Glu Glu Gly Val Glu Lys Pro Ala Glu Xaa His 
100 105 110 

Leu Ser Gly Lys He Gly Ala Lys Lys Leu Arg Xaa Xaa Glu Glu Lys 
115 120 125 

Gin Ala Arg Lys Ala Gin Xaa Glu Ala Glu Glu Ala Glu Arg Glu Xaa 
130 135 140 

Arg Lys Arg Leu Glu Ser Gin Arg Glu Xaa 
145 150 



(2) INFORMATION FOR SEQ ID NO: 348: 

, (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTICWT: SEQ ID NO: 348: 

Met Gin Lys Cys Met Leu Ser Ala Leu Val Phe His He Gin Trp Ser 
15 10 15 

Xaa 



(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

Met Leu Val Cys Ser Phe Leu Phe Leu Xaa 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

Val lie Glu Leu Cys Val Ser Leu Arg Ser Leu Asn Phe Xaa 
15 10 



(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

Met Cys Glu Phe Xaa Xaa Xaa He Met Xaa Leu Ala Gly Tyr Phe Ala 
1 5 . 10 15 

Cys Xaa 



(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

Met Val Gly Gly Tyr Val Ser Ser Phe Ser Phe Pro Pro Val Ser Ser 
1 5 10 15 

Ser Leu Leu Leu Pro Ala Ser Phe Ala Phe Pro Phe Leu Pro Gly Thr 
20 25 30 ' 

Pro Cys Pro Phe Leu Tyr Phe Leu Pro Ser Pro Phe Ser Pro Leu Pro 
35 40 45 

Leu Ser Leu Thr Arg Ser Asn Ser Phe Leu Leu Asn Gly Xaa 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LQIGTH: 33 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

Glu Lys Lys Ser Met Ser Val Ser Asp He Tyr Ala Leu Glu Ser Leu 
15 10 15 

Gly Arg Ser Leu Phe Thr Leu Asn Ser Met Cys Leu Pro Leu Ser Phe 
20 25 30 ' 



Xaa 
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(2) INFORfilATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 

Met Gly Gly Ala Ser Arg Arg Val Glu Ser Gly Ala Trp Ala Tyr Leu 
15 10 15 

Ser Pro Leu Val Leu Arg Lys Glu Leu Glu Ser Leu Val Glu Asn Glu 
20 25 30 

Gly Ser Glu Val Leu Ala Leu Pro Glu Leu Pro Ser Ala His Pro He 
35 40 45 

He Phe Trp Asn Leu Leu Trp Tyr Phe Gin Arg Leu Arg Leu Pro Ser 
50 55 60 

He Leu Pro Gly Leu Val Leu Ala Ser Cys Asp Gly Pro Ser Xaa Ser 
65 70 75 80 

Gin Ala Pro Ser Pro Trp Leu Thr Pro Asp Pro Ala Ser Val Gin Val 
85 90 95 

Arg Leu Leu Trp Asp Val Leu Thr Pro Asp Pro Asn Ser Cys Pro Pro 
100 105 110 

Leu Tyr Val Leu Trp Arg Val His Ser Gin He Pro Gin Arg Val Val 
115 120 125 

Trp Pro Gly Pro Val Pro Ala Ser Leu Ser Leu Ala Leu Leu Glu Ser 
130 13*5 140 

Val Leu Arg His Val Gly Leu Asn Glu Val His Lys Ala Val Gly Leu 
145 150 155 160 

Leu Leu Glu Thr Leu Gly Pro Pro Pro Thr Gly Leu His Leu Gin Arg 
165 170 175 

Gly He Tyr Arg Glu He Leu Phe Leu Thr IfeC Ala Ala Leu Gly Lys 
180 185 190 

Asp His Val Asp He Val Ala Phe Asp Lys Lys Tyr Lys Ser Ala Phe 
195 200 205 

Asn Lys Leu Ala Ser Ser Met Gly Lys Glu Glu Leu Arg His Arg Arg 
210 215 220 

Ala Gin Met Pro Thr Pro Lys Ala He Asp Cys Arg Lys Cys Phe Gly 
225 230 235 240 



Ala Pro Pro Glu Cys 
245 
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(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: axnino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTXQtl: SEQ ID NO: 355: 

Met Lys Phe Ser Lea Leu Phe Leu Pro Met Leu Leu lie Leu Lys Pro 
15 10 15 

Asp Leu Phe His lie Ser lie Cys Thr Leu Ala Ala Cys Gly Leu Thr 
20 25. 30 

Phe Pro Xaa 
35 



<2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: v 

(A) LENGTH: 22 amino acids 

(B) TVPE: axnino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

Met Leu Phe Phe Phe He Leu His Leu Leu Ser He Met Ser Phe Leu 
1 5 10 15 

Ser Pro Asp He Met Xaa 
20 



(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 

Met Phe Gly Leu Leu Val Glu Ser Gin Thr Leu Leu Glu Glu Asn Ala 
15 10 15 

Val Gin Gly Thr Glu Arg Thr Leu Gly Leu Asn He Ala Pro Phe He 
20 25 30 

Asn Gin Phe Gin Val Pro He Arg Val Phe Leu Asp Leu Ser Ser Leu 
35 40 45 

Pro Cys He Pro Leu Ser Lys Pro Val Glu Leu Leu Arg Leu Asp Leu 
50 55 60 

Met Thr Pro Tyr Leu Asn Thr Ser Asn Arg Glu Val Lys Val Tyr Val 
65 70 75 80 ' 
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Cys Xaa He Trp Glu Asp Leu Thr Ala He Pro Phe Trp Val Ser Tyr 
85 90 95 



Val Pro 



(2) . INFORMATION FOR SEQ ID NO: 358: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQ^GTH: 78 amino acids 

(B) TYPE: aitdno acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

Met Phe Gly Ala His Arg Xaa Trp Gin Gly Ser Val Leu Leu Phe Leu 
1 5 10 15 

Ser Phe Ala Trp Gly Asn Gly Gly Ser Val Thr Phe Ser Asp Val Pro 
20 25 30 

Arg Val Met Pro Leu Ala Gly Gly Pro Xaa Xaa Gin Val Ser- Ser Thr 
35 40 45 

Pro Pro Pro His Gin Val Thr Ser Ser Pro Gly Leu Glu Ser 
55 60 

He Val Cys Pro Glu Arg Lys Lys Lys Lys Lys Lys 
70 75 



Pro Arg 
50 

Ala His 
65 



(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:^ 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

Thr Leu Leu Xaa Phe Leu .Xaa Leu Leu Thr Thr Glu Gly Gly Arg Glu 
15 10 15 

Asn He Phe Xaa Gly Arg He Leu Xaa Leu Gin Xaa Ser Pro Xaa 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 360: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 



Met Leu Ser Phe Phe He Cys Leu Leu He Phe Val His Leu Leu Leu 
1 5 10 15 
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Leu Ser Phe Leu lie Ser Asp Trp Pro Pro Pro Thr Gly Ser Ala Xaa 
20 25 30 

His Lys He Leu Arg Leu Met Val Val Gin Arg Leu Ser Leu Leu Asp 
35 40 45 

Gin Arg Lys Arg Tip Ser Glu Ala Xaa 
50 55 



(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

Lys Tyr Xaa 
1 



(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 aitdno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 

Trp Ser Ser Ala Ser Ser Ser Trp Val Thr Thr Pro Glu Arg He Arg 
15 10 15 

Pro Arg Met Asp Thr Leu Pro Val Lys Gly His Phe Leu Ser Met Xaa 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

Asp He Phe Val Phe Leu Leu Ser Thr Arg Ala Gly Gly Leu Gly He 
1 5 10 15 

Asn Leu Thr Ala Xaa Asp Thr Val His Phe Leu Xaa 
20 25 



(2) INFORMATION FOR SEQ ID NO: 364: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 

Thr Leu Thr Ser Phe Leu Glu Leu Pro Leu Ala Pro Glu Pro Xaa 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 365: 

y 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 

Met His Arg Tyr He Thr Phe Phe Lys Cys Phe Arg Ser Val He Leu 
15 10 15 

Asp Leu Leu Phe He Leu Ser Pro Leu Ser Gin Gly Cys Phe He Leu 
20 25 30 

Phe Xaa 



(2) INFORMATiafl FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOQf: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

Met Phe Gly Phe He Phe Leu Leu Leu He Phe Cys He Xaa Leu Cys 
1 5 .10 15 

Ser Arg Thr Leu Ser Thr Phe He Pro Lys Leu Val Gly Phe Leu Tyr 
20 25 30 

Trp Lys Phe Ser He Asn Leu Ser Leu Leu Leu Thr Leu He Lys Lys 
35 40 45 

Lys Lys Lys Lys Lys Lys Thr Pro Arg Gly Gly Pro Gly Xaa Gin Ser 
50 55 60 

Pro Pro 
65 



(2) INF0RMATIC»7 FOR SEQ ID NO: 367: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 

Met Pro Gly Leu Gly Arg Pro Arg Gin Ala Arg Trp Thr Leu Met Leu 
1 S 10 15 

Leu Leu Ser Thr Ala Met Tyr Gly Ala His Ala Pro Leu Leu Ala Leu 
20 25 30 

Cys His Val Asp Gly Arg Val Pro Phe Arg Pro Ser Ser Ala Val Leu 
35 40 45 

Leu Thr Glu Leu Thr Lys Leu Leu Leu Cys Ala Phe Ser Leu Leu Val 
50 55 60 

Gly Trp Gin Ala Trp Pro Gin Gly Pro Pro Pro Trp Arg Gin Ala Ala 
65 70 75 80 

Pro Phe Ala Leu Ser Ala Leu Leu Tyr Gly Ala Asn Asn Asn Leu Val 
85 90 95 

lie Tyr Leu Gin Arg Tyr Met Asp Pro Ser Thr Tyr Gin Val Leu Ser 
100 105 110 

Asn Leu Lys lie Gly Ser Thr Ala Val Leu Tyr Cys Leu Cys Leu Arg 
115 120 125 

His Arg Leu Ser Val Arg Gin Gly Leu Ala Leu Leu Leu Leu Met Ala 
130 135 140 

Ala Gly Ala Cys Tyr Ala Ala Gly Gly Leu Gin Val Pro Gly Asn Thr 
145 150 155 160 

Leu Pro Ser Pro Pro Pro Ala Ala Ala Ala Ser Pro Met Pro Leu His 
165 170 175 

He Thr Pro Leu Gly Leu Leu Leu Leu He Leu Tyr Cys Leu He Ser 
180 185 190 

Gly Leu Ser Ser Val Tyr Thr Glu Leu Leu Met Lys Arg Gin Xaa Leu 
195 200 205 

Pro Leu Ala Leu Gin Asn Leu Phe Leu Tyr Thr Phe Gly Val Leu Leu 
210 215 220 

Asn Leu Gly Leu His Ala Gly Gly Gly Ser Gly Pro Gly Leu Leu Glu 
225 230 235 240 

Gly Phe Ser Gly Trp Ala Ala Leu Val Val Leu Ser Gin Ala Leu Asn 
245 250 255 

Gly Leu Leu Met Ser Ala Val Met Lys His Gly Ser Ser He Thr Arg 
260 265 270 

I-eu Phe Val Val Ser Cys Ser Leu Val Val Asn Ala Val Leu Ser Ala 
■ 275 280 285 

Val Leu Leu Arg Leu Gin Leu Thr Ala Ala Phe Phe Leu Ala Thr Leu 
290 295 300 
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Leu He Gly Leu Ala Met Arg Leu Tyr Tyr Gly Ser Arg 
305 310 315 



(2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

Met Gly Glu Gin Pro His Phe Ser Leu Cys Val Leu Leu Ala Ala Val 
1. 5 10 15 

Arg Glu Asp Xaa Asp Pro Xaa Val Phe Pro Cys Cys Phe Leu Xaa 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 

Met Ser Phe lie Ala Leu His Pro Leu Leu Pro Glu Ala Ala Leu Gly 
1 5 10 15 

Val Pro Gly Gin Ser Pro His Arg Pro Leu Trp Gin Thr Gin Cys Cys 
20 25 30 

Val Ala Pro Pro Gin Pro Ajag. Ala Glu Phe Xaa 
35 40 



(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 

Met Val Thr Ala Leu Thr Leu Leu Ala Phe Pro Leu Leu Leu Leu His 
1 5 10 15 

Ala Glu Arg lie Ser Leu Val Phe Leu Leu Leu Phe Leu Gin Ser Phe 
20 25 30 

Leu Leu Leu His Leu Leu Ala Ala Gly He Pro Val Thr Thr Pro Gly 
35 40 45 

Pro Phe Thr Val Pro Trp Gin Ala Val Ser Ala Trp Ala Leu Met Ala 
50 55 60 
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Thr Gin Thr Phe Tyr Ser Thr Gly His Gin Pro Val Phe Pro Ala He 
65 70 75 80 

His Trp His Ala Ala Phe Val Gly Phe Pro Glu Gly His Gly Ser Cys 
85 90 95 

Thr Trp Leu Pro Ala Leu Leu Val Gly Ala Asn Thr Phe Ala Ser His 
100 105 110 

Leu Leu Phe Ala Val Gly Cys Pro Leu Leu Leu Leu Trp Pro Phe Leu 
115 120 125 

Cys Glu Ser Gin Gly Leu Arg Lys teg Gin Gin Pro Pro Gly Asn Glu 
130 135 140 

Ala Asp Ala Arg Val Arg Pro Glu Glu Glu Glu Glu Pro Leu Met Glu 
145 150 155 160 

Met Arg Leu Arg Asp Ala Pro Gin His Phe Tyr Ala Ala Leu Leu Gin 
165 170 175 

Leu Gly Leu Lys Tyr Leu Phe He Leu Gly He Gin He Leu Ala Cys 
180 185 190 

Ala Leu Ala Ala Ser He Leu Arg Arg His Leu Met Val Trp Lys Val 

195 200 205 

Phe Ala Pro Lys Phe He Phe Glu Ala Val Gly Phe He Val Ser Ser 
210 215 220 

Val Gly Leu Leu Leu Gly He Ala Leu Val Met Arg Val Asp Gly Ala 
225 230 235 240 



Val Ser Ser Trp Phe Arg Gin Leu Phe Leu Ala Gin Gin Arg Xaa 
245 250 255 



(2) INFDRMATICW FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»r: SEQ ID NO: 371: 

Met Xaa Gly Pro Trp Gly Glu Glu Ala Leu He Arg Leu Pro Thr Pro 
1 5 10 15 

Ser Gly Leu Xaa 
20 



(2) INFORMATION FOR SEQ ID NO: 372: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 
■ (B) TVPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIOI: SEQ ID NO: 372: 

Met Ala Thr Leu Glu Xaa Asn Gin Arg Glu Val Asp Arg Glu He Aro 
5 1 5 10 15 

Ser Leu Leu Leu Trp Phe Leu Leu Cys Glu lie Val Ser Gly Trp Leu 
20 25 30 

10 Cys Pro Glu Gly Pro Trp Phe Ser Gin Gly Cys Gin He Tyr Lys Asn 
35 40 45 

Leu Ser Ser Ser Ser Ser Tyr Asn Leu Ser Phe Leu Leu Ser Leu Xaa 

15 



20 

(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

Met He His Ser Gly Cys Thr Ser Gin Cys Leu Glu Gly Phe Phe Leu 
15 10 15 

He Phe Leu Leu Asp Phe Asn Pro Val Leu Ala Leu Asp Leu He Gly 
20 25 30 

35 He Met Arg Lys Ala Ser His Xaa 
35 40 



25 



30 



40 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) .TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

Met Val Phe Ser Ala Arg Val Ser Leu Tyr Thr Arg Phe Lys Val He 
1.5 10 15 

Leu Leu Ser Leu Leu He Met He Leu His Val Cys Trp Val Trp Val 
20 25 30 

He Leu Xaa 
35 



(2) INFORMATION FOR SEQ ID NO: 375: 
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(i) SEQUENCE CffiVRACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 

Gly Leu Leu Tyr He Met Tyr Cys Asn He Xaa 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

Met Asn Asn Gly Leu Leu Gin Gin Pro Ser Ala Leu Met Leu Leu Pro 
15 10 15 

Cys Arg Pro Val Leu Thr Ser Val Ala Leu Asn Ala Asn Phe Val Ser 
20 25 30 

Trp Lys Ser Arg Thr Lys Tyr Thr He Thr Pro Val Lys Met Arg Lys 
35 40 45 

Ser Gly Gly Arg Asp His Thr Gly Gly Asn Lys Asp Arg Gly He Xaa 
50 55 60 



(2) INFORMATION FOR SEQ la NO: 377: 

(i) SEQUENCE CffiVRACTERISTICS : 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

Met Arg Lys Gin Arg Leu Val Pro Met Tyr Leu Gly Leu He Tyr He 
1 5 -10 15 

Leu Leu Xaa 



(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 
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Met Arg Gin His Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO: 379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 

Leu Leu Leu Pro Val Leu Ala Ser Ser Val Pro Ser His Ser Ala Thr 

^ 5 . 10 15 . 

Xaa 



(2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

Met Leu Pro Leu Leu Leu Phe Thr Tyr Leu Asn Ser Phe Leu His Gin 
^ 5 10 ' 15 

Arg He Pro Gin Ser Val Arg He Leu Gly Ser Leu Val Ala He Leu 
20 25 30 

Leu Val Phe Leu He Thr Ala He Leu Val Lys Val Gin Leu Asp Ala 
35 ; ^ 40 45 

Leu Pro Phe Phe Val He Thr Met He Lys He Val Leu He Asn Ser 
50 55 60 

Phe Gly Ala He Leu Gin Gly Ser Leu Phe Gly Leu Ala Gly Leu Leu 
65 70 75 80 ^ 

Pro Ala Ser Xaa 



(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOI: SEQ ID NO: 381: 

Met Lys Leu Ser Leu Phe Leu He Leu Ser Asp Val Phe Tyr Leu Gly. 
^ 5 10 15 ' 
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Ser Pro Xaa Thr Xaa 
20 



(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

Met Gly Thr Arg Arg Lys Gly Val Ala Trp Leu Ser Leu Ala Pro Leu 
15 10 15 

lie Thr Gly Leu Ala Pro Ala His lie Thr Ala Val Xaa 
20 25 



(2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
. (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

Met Lys Asp Leu Leu Gin Arg Asn Pro Trp Lys Asn Ser Leu Leu Leu 
15 10 15 

Leu Gin Val Cys Gin Ala Phe Leu Val Cys Ser Leu Thr Gin Leu Ala 
20 25 30 

Val Xaa 



(2) INFORMATION FOR SEQ ID NO: 3i84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

Met Ser. Glu Ser His Lys He Trp Trp Cys Tyr Arg His Leu Ala Phe 
1-5 10 15 

Pro Leu Leu Thr Leu He Leu Tyr Pro Ala Thr Leu Gly Arg Ser Val 
20 25 30 

Phe Cys His Asp Cys Lys Phe Pro Glu Ala Ser Pro Ala Met Xaa 
35 4Q . 45 



(2) INFORMATION FOR SEQ ID NO: 385: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

Met Leu Asn Arg He Met Val Ala Ser Phe Gly Ala Val Leu Val Gin 
15 10 15 

Val Cys Arg Gly Xaa Gly Gin Gly Xaa 
20 25 



(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) iSEQUENCE DESCRIPTION: SEQ ID NO: 386: 

Met Gin Leu Leu Leu Leu Gly Leu He Arg Ser Gin Pro Ser Pro Pro 
1 5 10 15 

Pro Ser Leu Cys Leu Met Leu Cys Pro Cys Leu Pro Cys Leu Arg Tyr 
20 . 25 30 

Ser Pro Phe Val Pro Gin His Pro Cys Pro Leu Pro Leu Asp Leu Cys 
35 40 45 

Leu Ala Gly Cys Ser Ser Leu Ser Val Gin Asp Lys Cys Ser Trp Pro 
50 55 60 

Tyr Pro He Xaa 
65 



(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

Lys Glu Phe Phe Val Phe Leu Phe Val Cys Leu Phe Trp Leu Leu Ser 
1 5 -10 15 

Asn Thr Pro Leu Thr Phe He Ser He He Leu Gin Arg Lys Glu Thr 
20 25 ' 30 

Asn Xaa 



(2) INFORMATION FOR SEQ ID NO: 388: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) ^TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

Ser Phe Leu Met Val Leu Val He Leu Ala Ala Ser Pro Xaa 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 389: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 



10 



(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

Met Thr Lys Ala Arg Leu Phe Arg Leu Trp Leu Val Leu Gly Ser Val 
1 5 10 15 

Phe Met He Leu Leu He lie Val Tyr Trp Asp Ser Ala Gly Ala Ala 
20- 25 30 

His Phe Tyr Leu His Thr Ser Phe Ser Arg Pro His Thr Gly Pro Pro 
35 40 45 

Leu Pro Thr Pro Gly Pro Asp Arg Asp Arg Glu Leu Thr Ala Asp Ser 
50 55 60 

Asp Val Asp Xaa Phe Leu Asp Xaa Phe Leu Ser Ala Gly Val Lys Gin 
^5 70 75 80 

Ser Asp Xaa Pro Arg Lys Glu Thr Glu Gin Pro Pro Ala Pro Gly Ser 
85 90 95 

Met Glu Glu Ser Val Arg Xaa Tyr Asp Trp Ser Pro Arg Xaa Ala Arg 
100 105 110 

Arg Thr Gin Thr Arg Ala Gly Ser Xaa Arg Xaa Gly Gly Xaa Cys Cys 
115 120 125 

Gly Ala Ser Ala Pro Xaa Pro Ala Ttp Pro Ser Pro Pro Arg Ser Ala 
130 135 140 
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His Ser Thr Thr Ser Pro Thr Arg Ser Xaa 
145 150 



(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

Met Val Leu Leu Gly Leu Leu Ser Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO: 392: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 

Met Cys lie His Val Phe Met Xaa Val Leu Trp Val Leu Phe Leu Leu 
15 10 15 



Ash Pro Leu Cys Thr Gly Leu Trp 
20 

Leu Arg His Ala Asp Trp Val Leu 
35 40 



Pro Leu Xaa Asn Cys Phe Ser Val 
25 30 

Gly Ala Asp Tyr Lys Gly Glu Glu 
45 



Leu Asn Arg His Gin Gly . Pro Met Lys Pro Lys Asp Xaa 
50 as. 60 



(2) INFORMATION FOR SEQ ID NO: 393: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3.93: 

Met Leu Leu Gly Leu Leu Met Ala Ala Cys Phe Thr Phe Cys Leu Ser 
1 5 10 ' 15 

His Gin Asn Leu Lys Glu Phe Ala Leu Thr Asn Pro Glu Lys Ser Ser 
20 25 30 

Thr Lys Glu Thr Glu Arg Lys Glu Thr Lys Ala Glu Glu Glu Leu Asp 
35 40 45 

Ala Glu Val Leu Glu Val Phe His Pro Thr His Glu Trp Gin Ala Leu. 
50 55 60 
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Gin Pro Gly Gin Ala Val Pro Ala Gly Ser His Val Arg Leu Asr. leu 
65 70 75 b: 

Gin Thr Gly Glu Arg Glu Ala Lys Leu Gin Tyr Glu Asp Lys Phe Arg 
85 90 2= 

Asn Asn Leu Lys Gly Lys Arg Leu Asp lie Asn Thr Asr. Thr ri.-r Thr 
100 105 110 

Ser Gin Asp Leu Lys Ser Ala Leu Ala Lys Phe Lys Glu Gly Ala 31u 
115 120 125 

Met Glu Ser Ser Lys Glu Asp Lys Ala Arg Gin Ala Glu Val lys Arg 
130 135 140 

Leu Phe Arg Pro lie Glu Glu Leu Lys Lys Asp Phe Asp Glu leu Asn 
145 150 155 153 

Val Val He Glu Thr Asp Met Gin He Met Val Arg Leu He Asr. lys 
165 170 175 

Phe Asn Ser Ser Ser Ser Ser Leu Glu Glu Lys He Ala Ala leu ?r-e 
180 185 190 

Asp Leu Glu Tyr Tyr Val His Gin Met Asp Asn Ala Gin Asp leu leu 
195 200 205 

Ser Phe Gly Gly Leu Gin Val Val He Asn Gly Leu Asn Ser -hr 31u 
210 215 220 

Pro Leu Val Lys Glu Tyr Ala Ala Phe Val Leu Gly Ala Ala P.he Ser 
225 230 235 24D 

Ser Asn Pro Lys Val Gin Val Glu Ala He Glu Gly Gly Ala leu 31n 
245 250 255 

Lys Leu Leu Val He Leu A^a Thr Glu Gin Pro Leu Thr Ala lys lys 
260 265 . 270 

Lys Val Leu Phe Ala Leu Cys Ser Leu Leu Arg His Phe Pro Tyr Ala 
275 280 235 

Gin Arg Gin Phe Leu Lys Leu Gly Gly Leu Gin Val Leu Arg Ihr leu 
290 295 300 . 

Val Gin Glu Lys Gly Thr Glu Val Leu Ala Val Arg Val Val rrjr leu 
305 310 315 320 

Leu Tyr Asp Leu Val Thr Glu Lys Met Phe Ala Glu Glu Glu Ala Glu 
325 330 335 

Leu Thr Gin Glu Mfet Ser Pro Glu Lys Leu Gin Gin Tyr Arg Gin Val 
340 345 350 

His Leu Leu Pro Gly Leu Trp Glu Gin Gly Trp Cys Glu He Thr Ala 
355 360 365 

His Leu Leu Ala Leu Pro Glu His Asp Ala Arg Glu Lys Val Le^ Gin 
370 375 380 
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Thr Leu Gly Val Leu Leu Thr Thr Cys Arg Asp Arg Tyr Arg Gin Asp 
385 390 555 400 

Pro Gin Leu Gly Arg Thr Leu Ala Ser Leu Gin A^a Glu Tyr Gin Val 
405 41C 415 

Leu Ala Ser Leu Glu Leu Gin A-sp Gly Glu Asp Glu Gly Tyr Phe Gin 
420 425 430 

Glu Leu Leu Gly Ser Val Asn Ser Leu Leu Lys Glu Leu Arg Xaa 
435 443 445 



(2) INFORMATIOJ FOR SEQ ID NC: 394: 

(i) SEQUENCE CHARACTEPr S?ICS : 

(A) LENGTH: 24 anino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: iir.sar ' 

<xi) SEQUENCE DESCRIPTi:^!: SEQ ZD 170: 394: 

Met Val lie Ser Tyr Val Thr Phe Thr Pro Val Ser Ala A-sp Cys Phe 
15 10 15 

Phe Asn Val Leu Val Cys Phe Xaa 
20 



(2) INFORMATION FOR SEQ ID NC: 395: 

(i) SEQUENCE CKARACTEFZSTICS: 

(A) LENGTH: 24 2=dno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTrCJJ: SEQ ID !X}: 395: 

Glu Leu Leu Phe Leu Leu lie He lie Leu Gly Glu Ser Leu Ser Asp 
15 10 15 

Val lie Leu Leu lie Cys Phe liaa 
20 



(2) INFORMATION FOR SEQ ID NO: 396: 

(i) SEQUENCE CHARACTEPZSTICS : 

(A) LENGTH: 35 asdno acids 

(B) TYPE: amir.o acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID IX): 396: 

Met Phe Tyr Trp Gly Gly Leu Ser Phe Tyr Phe Leu Leu Ser Ser Gly 
15 10 15 



Val Gly Phe Tyr Cys Phe Leu Phe Gly Phe Gly Met Glu He Trp He 
20 25 30 
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Ala Ala Xaa 
35 



(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 

Gly Arg Xaa 
1 



(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 

Met Lys Leu Ser Leu Leu lie Leu Thr Leu Met Gin Arg Tyr Phe Arg 
15 10 15 

Thr lie Thr Asn Ser Leu Cys Lys Xaa 
20 25 



(2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARAeTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 

Met Pro Ala Val Ser Gly Pro Gly Pro Leu Phe Cys Leu Leu Leu Leu 
1 5 10 15 

Leu Leu Asp Pro His Ser Pro Glu Thr Gly Cys Pro Pro Leu Arg Arg 
20 25 30 

Phe Glu Tyr Lys Leu Ser Phe Lys Gly Pro Arg Leu Ala Leu Pro Gly 
35 40 45 

Ala Gly He Pro Phe Trp Ser His His Gly Gly Glu Gly Gin- Gly Trp 
50 55 60 

Gly Pro Leu Cys Pro Gly Ser Leu Lys Val Leu Glu Gly Leu Xaa 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 400: 
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{i) SEQUENCE CHARACTERISTICS: 

■ (A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(D) TOPOIiOOT: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 

Met Lys Val Phe Leu Ser Met Pro Phe Leu Val Leu Phe Gin Ser Leu 
15 10 15 

lie Gin Glu Asp Xaa 
20 



(2) INFORMATICXJ FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 amino acids . 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 

Met Ala Ala Leu Thr Ser His Leu Gin Asn Gin Ser Asn Asn Ser Asn 
1 5 10 15 

Trp Asn Leu Arg Thr Arg Ser Lys Cys Lys Lys Asp Val Phe Met Pro 
20 25 30 

Pro Ser Ser Ser Ser Glu Leu Gin Glu Ser Arg Gly Leu Ser Asn Phe 
35 40 45 

Thr Ser Thr His Leu Leu Leu Lys Glu Asp Glu Gly Val Asp Asp Val 
50 55 60 

Asn Phe Arg Lys Val Arg Lys Pro Lys Gly Lys Val Thr lie Leu Lys 
65 70 . . 75 80 

Gly He Pro He Lys Lys Thr Lys Lys Gly Cys Arg Lys Ser Cys Ser 
85 90 95 

Gly Phe Val Xaa Ser Asp Ser Lys Arg Glu Ser Val Cys Asn Lys Ala 
100 105 110 

Asp Ala Glu Ser Glu Pro Val Ala Gin Lys Ser Gin Leu Asp Arg Thr 

115 120 125 

Val Cys He Ser Asp Ala Gly Ala Cys Gly Glu Thr Leu Ser Val Thr 
130 135 140 

Ser Glu Glu Asn Ser Leu Val Lys Lys Lys Glu Arg Ser Leu Ser Ser 
145 150 155 160 

Gly Ser Asn Phe Cys Ser Glu Gin Lys Thr Ser Gly He He Asn Lys 
165 170 175 

Phe Cys Ser Ala Lys Asp Ser Glu His Asn Glu Lys Tyr Glu Asp Thr 
180 185 190 



Phe Leu Glu Ser Glu Glu He Gly Thr Lys Val Glu Val Val Glu Arg 
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195 200 

Lys Glu His Leu His Thr Asp He 
210 215 

Asn Asn Cys Ser Pro Thr Arg Lys 
225 230 

Arg Asn Thr Asp Arg Lys Lys Glu 
245 

He 



565 

205 

Leu Lys Arg Gly Ser Glu Met Asp 
220 

Asp Phe Thr Glu Asp Thr He Pro 
235 240 

Asn Lys Pro Val Phe Phe Gin Gin 
250 255 



(2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 424 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 

Met Glu Lys Gin Cys Cys Ser His Pro Val He Cys Ser Leu Ser Thr 
1 5 10 . 15 



Mtet Tyr Thr Phe Leu Leu Gly Ala 
20 

Arg He Leu Leu Val Lys Tyr Ser 
35 40 

Tyr Leu Pro Thr Thr Val Asn Val 
50 55 



He Phe He Ala Leu Ser Ser Ser 
25 30 

Ala Asn Glu Glu Asn Lys Tyr Asp 
45 

Cys Ser Glu Leu Val Lys Leu Val 
60 



Phe Cys Val Leu Val Ser Phe- Cys Val He Lys Lys Asp His Gin Ser 
65 70 75 80 

Arg Asn Leu Lys Tyr Ala Ser Trp Lys Glu Phe Ser Asp Phe Met Lys 
85 90 95 

Trp Ser He Pro Ala Phe Leu Tyr Phe Leu Asp Asn Leu He Val Phe 
100 105 110 

Tyr Val Leu Ser Tyr Leu Gin Pro Ala Met Ala Val He Phe Ser Asn 
115 120 125 

Phe Ser He He Thr Thr Ala Leu Leu Phe Arg He Val Leu Lys Xaa 
130 135 140 

Arg Leu Asn Trp He Gin Trp Ala Ser Leu Leu Thr Leu Phe Leu Ser 
145 150 155 160 

He Val Ala Leu Thr Ala Gly Thr Lys Thr Leu Gin His Asn Leu Ala 
165 170 175 

Gly Arg Gly Phe His His Asp Ala Phe Phe Ser Pro Ser Asn Ser Cys' 
180 185 190 
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Leu Leu Phe Arg Asn Glu Cys Pro Arg Lys Asp Asn Cys Thr Ala Lys 
. 195 200 205 

Glu Trp Thr Phe Pro Glu Ala Lys Trp Asn Thr Thr Ala Arg Val Phe 
210 215 220 



10 



Ser His lie Arg Leu Gly Met Gly His Val Leu He He Val Gin Cys 
225 230 235 240 

Phe He Ser Ser Met Ala Asn He Tyr Asn Glu Lys He Leu Lys Glu 
245 250 255 



15 



Gly Asn Gin Leu Thr Glu Xaa He Phe He Gin Asn Ser Lys Leu Tyr 
260 265 270 

Phe Phe Gly He Leu Phe Asn Gly Leu Thr Leu Gly Leu Gin Arg Ser 
275 280 285 



Asn Arg Asp Gin He Lys Asn Cys Gly Phe Phe Tyr Gly His Ser Ala 
20 290 295 300 



25 



Phe Ser. Val Ala Leu He Phe Val Thr Ala Phe Gin Gly Leu Ser Val 
305 310 315 320 

Ala Phe He Leu Lys Phe Leu Asp Asn Met Phe His Val Leu Met Ala 
325 330 335 



30 



Gin Val Thr Thr Val He He Thr Thr Val Ser Val Leu Val Phe Asp 
340 345 350 

Phe Arg Pro Ser Leu Glu Phe Phe Leu Glu Ala Pro Ser Val Leu Leu 
355 360 365 



Ser He Phe He Tyr Asn Ala Ser Lys Pro Gin Val Pro Glu Tyr Ala 
35 370 375 380 



40 



Pro Arg Gin Glu Arg He Aaag Asp Leu Ser Gly Asn Leu Trp Glu Arg 
385 390 395 400 

Ser Ser Gly Asp Gly Glu Glu Leu Glu Arg Leu Thr Lys Pro Lys Ser 
405 410 415 



45 



Asp Glu Ser Asp Glu Asp Thr Phe 
420 



(2) INFORMATION FOR SBQ ID NO: 403: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LESICjTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 

Mtet Trp Gly Gin Gly Ser Gin Lys Ser His Phe Ser Asp Leu Val Phe 
15 10 15 

Gly Val Arg Glu Leu Cys Ala Gin Pro Ser Asp Pro Gly Ser Pro His 
60 20 25 30 
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Xaa 



(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 

Met Val Gin His He Gin Pro Ala Ala Leu Ser Leu Leu Ala Gin Trp 
15 10 15 

Ser Thr Leu Val Gin Glu Leu Glu Ala Ala Leu Gin Leu Ala Phe Tyr 
20 25 30 

Pro Asp Ala Val Glu Glu Trp Leu Glu Glu Asn Val His Pro Ser Leu 
.35 40 45 

Gin Arg Leu Gin Xaa Leu Leu Gin Asp Leu Ser Glu Val Ser Ala Pro 
50 . 55 60 

Pro Leu Pro Pro Thr Ser Pro Gly Arg Asp Val Ala Gin Asp Pro Xaa 
65 70 ,75 80 



(2) INFORMATION FOR SEQ ID NO: 405: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

Met Leu Asn Gin Gly Tyr He Arg Lys He He I^eu He He He Leu 
1 5 10 . 15 

Gly Ser Phe Ser Ser Pro Lys Lys Ala He Leu Met Gly Phe Gin Asn 
20 25 30 

Gin Lys Lys Ala Leu Asn Glu Glu Gin Thr Thr Gly Val Pro Met Ser 
35 40 45 

He Ser Gly Lys Leu Arg. Pro Ser Arg Ser Leu Asp Phe Val Gin Pro 
50 55 60 

Pro Arg Phe Gin Ser Gin Gin Pro Ser Ala Val Val Asp Arg Arg Gly 
65 . 70 75 80 



Phe Xaa Xaa Lys Ala Ala Arg Gly Gin Glu Phe Ser Glu Ser Xaa 
85 90 95 
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(2) INFORMATION FOR SEQ ID NO: 406: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 amino acids 

{B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

Met Arg Gly Pro Ala Gin Ala Lys Leu Leu Pro Gly Ser Ala lie Gin 
1 5 10 15 

Ala Leu Val Gly Leu Ala Arg Pro Leu Val Leu Ala Leu Leu Leu Val 
20 25 30 

Ser Ala Ala Leu Ser Ser Val Val Ser Arg Thr Asp Ser Pro Ser Pro 
35 40 45 



Thr Val Leu Asn Ser His lie Ser Thr Pro Asn Val Asn Ala Leu Thr 
50 55 60 

His Glu Asn Gin Thr Lys Pro Ser He Ser Gin He Ser Thr Thr Leu 
65 70 75 80 



Pro Pro Thr Thr Ser Thr Lys Lys Ser Gly Gly Ala Ser Val Val Pro 
85 90 95 

His Pro Ser Pro Thr Pro Leu Ser Gin Glu Glu Ala Asp Asn Asn Glu 
100 105 110 

Asp Pro Ser He Glu Glu Glu Asp Leu Leu Met Leu Asn Ser Ser Pro 
115 120 125 



Ser Thr Ala Lys Asp Thr Leu Asp Asn Gly Asp Tyr Gly Glu Pro Asp 
130 135 140 

Tyr Asp Trp Thr Thr Gly Pro Arg Asp Asp Asp Glu Ser Asp Asp Thr 
145 150 155 160 

Leu Glu Glu Asn Arg Gly Tyr Met Glu He Glu Gin Ser Val Lys Ser 
165 170 175 

Phe Lys Met Pro Ser Ser Asn He Glu Glu Glu Asp Ser His Phe Phe 
180 185 190 

Phe His Leu He He Phe Ala Phe Cys He Ala Val Val Tyr He Thr 
195 200 205 

Tyr His Asn Lys Arg Lys He Phe Leu Leu Val Gin Ser Arg Lys Trp 
210 215 220 

Arg Asp Gly Leu Cys Ser Lys Thr Val Glu Tyr His Arg Leu Asp Gin 
225 230 235 240 

Asn Val Asn Glu Ala Met Pro Ser Leu Lys He Thr Asn Asp Tyr He 
245 250 255 



Phe 
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(2) INFORMATIOI FOR SEQ ID NO: 407: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 

Met Phe Met Arg lie Ala Lys Ala Tyr Ala Ala Leu Thr Asp Glu Glu 
1 5 10 15 

15 Ser Arg Lys Asn Trp Glu Glu Phe Gly Asn Pro Asp Gly Pro Gin Ala 
20 25 30 



20 



Thr Ser Phe Gly lie Ala Leu Pro Ala Trp lie Val Asp Gin Lys Asn 
35 40 45 

Ser lie Leu Val Leu Leu Val Tyr Gly Leu Ala Phe Met Val lie Leu 
50 55 60 



Pro Val Val Val Gly Ser Trp Trp Tyr Arg Ser lie Arg Tyr Ser Gly 
25 65 70 75 80 

Asp Gin lie Leu lie Arg Thr Thr Gin lie Tyr Thr Tyr Phe Val Tyr 
85 90 95 

30 Lys Thr Arg Asn Met Asp Met Lys Arg Leu He Met Val Leu Xaa Gly 
100 105 110 

Ala Ser Glu Phe Asp Pro Gin Tyr Asn Lys Asp Ala Thr Ser Arg Pro 

115 120 125 

35 

Thr Asp Asn He Leu He Pro Gin Leu He Arg Glu He Gly Ser He 
130 135. 140 

Asn Leu Lys Lys Asn Glu Pro Pro Leu Thr Cys Pro Tyr Ser Leu Lys 
40 145 150 155 160 

Ala Arg Val Leu Leu Leu Ser His Leu Ala Ax^ Met Lys He Pro Glu 
165 170 175 

45 Thr Leu Glu Glu Asp Gin Gin Phe M&t Leu Lys Lys Cys Pro Ala Leu 
180 185 190 

Leu Gin Glu Met Val Asn Val He Cys Gin Leu He Val Met Ala Arg 
195 200 205 

50 

Asn Arg Glu Glu Arg Glu Phe Arg Ala Pro Thr Leu Ala Ser Leu Glu 
210 215 220 

Asn Cys Met Lys Leu Ser Gin Miet Ala Val Gin Gly Leu Gin Gin Phe 
55 225 230 235 240 

Lys Ser Pro Leu Leu Gin Leu Pro His He Glu Glu Asp Asn Leu Arg 
245 250 255 



60 



Arg Val Ser Asn His Lys Lys Tyr Lys He Lys Thr He Gin Asp Leu 
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260 



265 



270 



Val Ser Leu Lys Glu Ser Asp Arg His Thr Leu Leu His Phe Leu Glu 
275 280 285 

5 

Asp Glu Lys Tyr Glu Glu Val Met Ala Val Leu Gly Ser Phe Pro Tyr 
290 295 300 

Val Thr Met Asp lie Lys Ser Gin Val Leu Asp Asp Glu Asp Ser Asn 
10 305 310 315 320 

Asn lie Thr Val Gly Ser Leu Val Thr Val Leu Val Lys Leu Thr Arg 
325 330 335 

15 Gin Thr Met Ala Glu Val Phe Glu Lys Glu Gin Ser lie Cys Ala Ala 
340 345 350 



20 



Glu Glu Gin Pro Ala Glu Asp Gly Gin Gly Glu Thr Asn Lys Asn Arg 
355 360 365 

Thr Lys Gly Gly Trp Gin Gin Lys Ser Lys Gly Pro Lys Lys Thr Ala 
370 375 380 



Lys Ser Lys Lys Lys Lys Pro Leu Lys Lys Lys Pro Thr Pro Val Leu 
25 385 390 395 400 

Leu Pro Gin Ser Lys Gin Gin Lys Gin Lys Gin Ala Asn Gly Val Val 
405 410 415 

30 Gly Asn Glu Ala Ala Val Lys Glu Asp Glu Glu Glu Val Ser Asp Lys 
420 425 430 



35 



Gly Ser Asp Ser Glu Glu Glu Giu Thr Asn Arg Asp Ser Gin Ser Glu 
435 440 445 

Lys Asp Asp Gly Ser Asp Arg Asp Ser Asp Arg Glu Gin Asp Glu Lys 
450 • 455. 460 



Gin Asn Lys Asp Asp Glu Ala Glu Trp Gin Glu Leu Gin Gin Ser lie 
40 465 470 475 480 

Gin Arg Lys Glu Arg Ala Leu Leu Glu Thr Lys Ser Lys Xle Thr His 
485 490 495 

45 Pro Val Tyr Ser Leu Tyr Phe Pro Glu Glu Lys Gin Glu Trp Trp Trp 
500 505 510 



50 



Leu Tyr He Ala Asp Arg Lys Glu Gin Thr Leu He Ser Met Pro Tyr 
515 520 525 

His Val Cys Thr Leu Lys Asp Thr Glu Glu Val Glu Leu Lys Phe Pro 
530 535 540 



Ala Pro Gly Lys Pro Gly Asn Tyr Gin Tyr Thr Val Phe Leu Arg Ser 
55 545 550 555 560 



Asp Ser Tyr Met Gly Leu Asp Gin He Lys Pro Leu Glu Val Xaa Lys 
565 570 575 

60 Phe Met Arg Leu Lys Pro Val Pro Glu Asn His Pro Gin Trp Asp Thr 
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580 



585 



590 



Ala lie Glu Gly Asp Glu Asp Gin Glu Asp Ser Glu Gly Phe Glu Asp 
595 600 605 



Ser Phe Glu Gly Gly Arg Gly Arg Glu Glu Gly Aorg Trp Trp Thr 
610 615 620 



(2) INFORMATICS^ FOR 5EQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 

Met Lys Ala Ser Gin Cys Cys Cys Cys Leu Ser His Leu Leu Ala Ser 
15 10 15 

Val Leu Leu Leu Leu Leu Leu Pro Glu Leu Ser Gly Xaa Leu Xaa Val 
20 25 30 

Leu Leu Gin Ala Ala Glu Ala Ala Pro Gly Leu Gly Pro Pro Asp Ptq 
35 40 45 

Arg Pro Arg Thr Leu Pro Pro Leu Pro Pro Gly Pro Thr Pro Ala Gin 
50 55 60 

Gin Pro Gly Arg Gly Leu Ala Glu Ala Ala Gly Pro Arg Gly Ser Glu 
65 70 75 80 

Gly Gly Asn Gly Ser Asn Pro Val Ala Gly Leu Glu Thr Asp Asp His 
85 90 95 

Gly Gly Lys Ala Gly Glu G^y Ser Val Gly Gly Gly Leu Ala Val Ser 
100 105 110 

Pro Asn Pro Gly Asp Lys Pro Met Thr Gin Arg Ala Leu Thr Val Leu 

115 120 125 

Met Val Val Ser Gly Ala Val Leu Val Tyr Phe Val Val Arg Thr Val 
130 135 140 

Arg Met Arg Arg Arg Asn Arg Lys Thr Arg Arg Tyr Gly Val Leu Asp 
145 150 155 160 

Thr Asn He Glu Asn Met Glu Leu Thr Pro Leu Glu Gin Asp Asp Glu 
165 170 175 

Asp Asp Asp Asn Thr Leu Phe Asp Ala Asn His Pro Arg Arg 
180 185 190 



(2) INFORMATION FOR SEQ ID NO: 409: 



(i) SEQUENCE CHARACTERISTICS: 

(A) I4ENGTH: 179 amino acids 
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•'3) TCVz.: azdno acid 
fD) linear 
(xi) 5E^UZ:5CS ZESCriyZZrZli SEQ 13 iJO: 409: 

Met Ser Pro Ser Gly Arg Leu C/s leu Leu Thr He Val Gly Leu He 
15 10 15 

Leu Pro Thr Arg Gly Gin Thr Leu lys Asp Thr Thr Ser Ser Ser Ser 
2C 25 • ' 30 

Ala Asp Ser "hr He Me- As? He Glr. Val Pro Thr Arg Ala Pro Asp 
35 40 45 

Ala Val ?yr rhr Glu le-u Gin Pro Thr Ser Pro Thr Pro Thr Trp Pro 
50 55 60 

Ala Asp Glu Thr Pre Gin Pro Qln Tr^ Gin Thr Gin Gin Leu Glu Gly 
65 70 75 80 

Thr Asp Gly Pro Leu Val Thr As? Pro Glu Thr His Lys Ser Thr Lys 
S£ 90 95 

Ala Ala His Pro Thr Asp As? Thr Thr Thr Leu Ser Glu Arg Pro Ser 
ICC 1:5 110 

Pro Ser Thr As? Val Gin Thr As? Pro Gin Thr Leu Lys Pro Ser Gly 
115 120 125 

Phe His Glu As? As? Pro Pha Phe Tvr Asp Glu His Thr Leu Arg Lys 
130 135 140 

Arg Gly Leu Leu Val Ala Aia Val leu Phe lie Thr Gly He He He 
145 150 155 160 

Leu Thr Ser Gly Lys Zrrs Arg Gin Leu Ser Arg Leu Cys Arg Asn His 
16= 170 175 

Cys Arg :<aa 



(2) XMrC^^TICN ?Cr. S3Q ID NC: 413: 

(i) szQczNcz cHP:E^j::::^ssrr:csz 

(A) 1Z25GTH: 14 amino acids 
(3) Ti^Zi a-jno acid 
(D) TCPOLOGi: linear 
(xi) SSCOZrSTE 3ESC?::P^a3: SSX2 3 NO: 410: 

Met Phe Lys Cys Leu Gin Thr Thr Phe Leu Phe He Xaa Xaa 
1 " = 10 



(2) INFORMyriC?; FC?. S3Q TZ NO: 411: 

(i) CHARACTHPJSTTCS : 

(A) L25GTH: 232 p.rrno acids 
13) TVPE: anino acid 
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(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTiai: SBQ ID NO: 411: 

Met Leu Ala Gly Lys Leu He Pro Val His Gin Val Arg Gly Leu Lys 
1 5 10 i5 

Glu Lys He Val Arg Ser Phe Glu Val Ser Pro Asp Gly Ser Phe Leu 
20 25 30 

Leu He Asn Gly He Ala Gly Tyr Leu His Leu Leu Ala Met Lys Thr 
35 40 45 

Lys Glu Leu He Gly Ser Mfet Lys He Asn Gly Arg Val Ala Ala Ser 
50 ' 55 . 60 

Thr Phe Ser Ser Asp Ser Lys Lys Val Tyr Ala Ser Ser Gly Asp Gly 
^5 70 75 80 

Glu Val Tyr Val Trp Asp Val Asn Ser Arg Lys Cys Leu Asn Arg Phe 
85 90 95 

Val Asp Glu Gly Ser Leu Tyr Gly Leu Ser He Ala Thr Ser Arg Asn 
100 105 110 

Gly Gin Tyr Val Ala Cys Gly Ser Asn Cys Gly Val Val Asn He Tyr 
115 120 125 

Asn Gin Asp Ser Cys Leu Gin Glu Thr Asn Pro Lys Pro He Lys Ala 
130 135 140 

He Met Asn Leu Val Thr Gly Val Thr Ser Leu Thr Phe Asn Pro Thr 
150 155 160 

Thr Glu He Leu Ala He Ala Ser Glu Lys Met Lys Glu Ala Val Arg 
165 170 175 

Leu Val His Leu Pro Ser Thr Val Phe Ser Asn Phe Pro Val He 
180 185 190 

Lys Asn Lys Asn He Ser His Val His Thr Met Asp Phe Ser Pro Arg 
195 200 205 

Ser Gly Tyr Phe Ala Leu Gly Asn Glu Lys Gly Lys Ala Leu Met Tyr 
210 215 220 

Arg Leu His His Tyr Ser Asp Phe 
225 ■ 230 



(2) INFORMATION FOR SEQ ID NO: 412: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 



He Leu Leu Cys Ser Trp Pro Thr Gly Leu Val Gly Gly Arg Asp Pro 
^5 10 15 
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Gly Ser Ser Arg Gly Ser Ser Ala Ser Leu Thr Pro Ser Pro.Gly Arg 
20 25 30 

Gin Pro Cys Ser Arg Arg Arg Gly Tyr Ser Val Gly Arg Arg Ser Ser 
35 40 45 

Pro Pro Asp Gly Ser Xaa 
50 



(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 3 3. amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 

Met Ser Leu Gin Ser Asn Ala Trp Ser Lys Xaa Leu Phe lie Val Phe 
15 10 15 

Leu Phe Leu Arg Val Leu Phe Lys Thr Gly Val Ser Ser Glu Glu Ser 
20 25 3b 

Xaa 



(2) INFORMATIC»T FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

Met Ala Val Val Leu Leu Ala Asn Leu Ala Gin Gly Asp Ser Leu Ala 
1 5 10 15 

Ala Arg Ala lie Ala Val Gin Lys Gly Ser lie Gly Asn Leu Leu Gly 
20 25 30 

Phe Leu Glu Asp Ser Leu Ala Ala Thr Gin Phe. Gin Gin Ser Gin Ala 
35 40 45 

Ser Leu Leu His Met Gin Asn Pro Pro Phe Glu Pro Xaa Ser Val Asp 
50 55 60 

Met Met Arg Arg Ala Ala Arg Ala Leu Leu Ala Leu. Ala Lys Val Asp 
65 70 75 80 

Glu Asn His Ser Glu Phe Thr Leu Tyr Glu Ser Arg Leu Leu Asp He 
85 90 95 

Ser Val Ser Pro Leu Met Asn Ser Xaa Val Ser Gin Val He Cys Asp. 
* 100 105 110 
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Val Leu Phe Leu Xaa Trp Pro Val Met Thr Ala Val Gly His Leu Pro 
115 120 125 

Pro Pro Cys Val Cys Ala Cys Val Glu Asn Leu Glu Thr Asp Cys Cys 
130 135 140 

Pro Leu Phe Met Gin Asn His Leu Arg He Gli\ Phe Thr Leu Cys Cys 
145 150 155 160 

Pro Ala Ser Pro Leu Gly Lys Ser Leu Ser Cys Phe Ser Leu Leu Leu 
165 170 175 

Pro Pro Pro Leu Pro Pro Ser Pro His Ala Phe Leu Phe Leu Val Leu 
' 180 185 190 

Thr Leu Leu Pro Ser Gly Pro Tyr Pro Thr Leii Phe Glu Lys Thr Lys 
195 200 205 

Leu Cys Leu His Arg Arg Leu Phe Leu Phe Xaa 
210 215 



(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOI: SEQ ID NO: 415: 

Met Leu Pro Asp Glu Ser Phe Gly Leu Leu Leu Ser He Pro Ser Leu 
1 5 10 15 

Thr Pro Ser Ala Ala Ala Pro Ser Phe Cys Val His Leu Met Gin Ala 
20 25 30 

Ser Arg Ser Ser Lys Arg Ala Ser His Val Pro Val His Leu Leu Trp 
35 40 45 

Gly Asp Xaa 
50 



(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 

Met Arg Pro Gly Ser Phe Ser Phe He Ala Phe Leu Ala Thr Glu Val 
15 10 15 

Ser Ser Cys Phe Pro Gly Arg Pro Asp Cys Xaa Thr Gly Met Trp Leu 
20 25 30 

Leu Gin Leu Gin Lys Lys Gin Arg Thr Leu Leu Ala Met Ala Pro Arg 
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35 



40 



45 



Arg Xaa 
50 



(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) T^fPE: amino acid . 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

Asp Arg Pro Cys Pro Ser Ser Leu Trp Lys Val Phe Pro Leu Leu Leu 
15 10 15 

Leu Leu Met Arg Leu Phe Pro Leu Pro Val Pro Gly Asn Gin Arg Ala 
20 25 30 

Xaa Leu Pro His Pro Phe Xaa Ala Pro Arg Leu Pro Cys Leu Leu Cys 
35 40 45 

Leu Cys Thr Gin Gin Phe Xaa Val Cys Ser His Tyr Leu Pro Ala Gly 
50 55 60 

Tyr Arg Val Asn Ser Xaa 
65 '70 



(2) INFORMATION FOR SEQ ID NO: 418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

Met His Glu Lys Ala Trp Asn Leu lie Leu Leu Trp Trp Leu Ser Leu 
15 10 15 

Asp Leu Leu Gly Val Ala Lys Thr Ala Met Trp Ala Gin Trp Cys Gly 
20 25 30 

Leu Asn Asp His Lys Gly Lys Xaa 
35 40 



(2) INFORMATION FOR SEQ ID NO: 419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 

Met Ala Phe Val Leu Leu Xaa Cys Phe Val Xaa Leu Gin Ser Ser Xaa 
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Gly Arg Ala Val Gin Xaa 
20 



(2) INFORMATIC»I FOR SEQ ID ::0: 422; 

10 (i) SEQUENCE CHARACTZ?-3?i:5: 

(A) LENGTH: 3 5 srrir.- acids 
■ (B) r/PE: amino acid 
(D) TOPOLCXPsT: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID JIG: 420: 



15 



25 



30 



35 



40 



Met Phe Ser Leu Leu Trp Leu Val C:,-3 Val Prs Ser Asn Ser Ser Val 
15 10 15 

Ala Asn Val Thr Ala Ser Arg Gly Gly Val Phe Lys Arg Ser Leu Gly 



20 20 



30 



His Glu Gly Phe Ser Xaa 
35 



(2) INFORMATION FOR SEQ ID i30: 421: 

(i) SEQUENCE CHARACIZRISTIC3: 

(A) LENGTH: 35 asrlr.o acic2s 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRZPTIC:;: BZZ. ID rC: 421: 

Lys Trp Leu Leu Phe He Phe Leu Leu Cys leu Gin Leu Val Asn AJ.a 
15 10 15 

Leu Leu Ser Leu Phe Gin Glu Arg Phe Val rSs Oys Pro Ala Arg Phe 
20 25 ' 3: ' 

Val Ser Xaa 
35 



45 

(2) INFORMATION FOR SEQ ID :X>: 422: 

(i) SEQUENCE CHARACrZRISTICS : 

(A) LENGTH: 32 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ IE >3C: 422: 

Met Leu Leu Phe Leu Ser lis Thr Asn Ser Leu Ser Phe He Ser Val 
55 1 5 iQ 

Asp Lys Pro Phe Gly Gin Ser Glu Asp Val Cvs Pro Val He Ser Xaa 
20 . 25 ' 30 



60 
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(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

Met Glu Phe Leu Phe Asn Lys Thr Gly Trp Ala Phe Ala Ala Leu Cys 
15 10 15 

Phe Val Leu Ala Met Thr Ser Gly Gin Met Trp Asn His He Arg Gly 
20 25 30 

Pro Pro Tyr Ala His Lys Asn Pro His Thr Gly His Val Asn Tyr He 
35 40 45 

His Gly Ser Ser Gin Ala Gin Phe Val Ala Glu Thr His He Val Leu 
50 55 60 

Leu Phe Asn Gly Gly Val Thr Leu Gly Met Val Leu Leu Cys Glu Ala 
^5 70 75 80 

Ala Thr Ser Asp Met Asp He Gly Lys Arg Lys He Met Cys Val Ala 
85 90 95 . 

Gly He Gly Leu Val Val Leu Phe Phe Ser Trp Met Leu Ser He Phe 
100 105 110 

Arg Ser Lys Tyr His Gly Tyr Pro Tyr Ser Phe Leu Met Ser Xaa 
115 120 125 



(2) INPORMATI(»I FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

Met Thr Trp His Ser Arg Glu Ser Phe Xaa Leu Leu Arg Val Val Ala 
1 5 10 . 15 

Pro Ser Gin Ala Pro Gly Met Gin Val Ser Pro Ser Gin Arg Ala Trp 
20 25 30 

Arg Arg Pro Leu His Arg Cys His Val Ala Ala Pro Arg Pro His His 
35 40 45 

Phe Ala Phe Phe Arg Asn Pro Phe Ser Trp Ser Phe He Lys Leu Leu 
50 55 60 

Tyr Arg Tyr Leu Xaa 
65 
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(2) INFORMATION FOR SEQ ID NO: 425: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 

Met Gly Leu Lys Leu Asn Gly Arg Tyr He Ser Leu He Leu Ala Val 
^5 10 15 

Gin He Ala Tyr Leu Val Gin Ala Val Arg Ala Ala Gly Lys Cys Asp 
20 25 30 

Ala Val Phe Lys Gly Phe Ser Asp Cys Leu Leu Lys Leu Gly Asp Thr 
35 40 45 

Trp Pro Thr Thr Arg Ser Leu Gly Arg Gin Asp Glu His Gin Asp Arg 
50 55 60 

Val His He Leu Gly Gly Phe Pro Gin Leu His Gly His Ser Pro Tyr 
^5 70 75 80 

Gly Leu Pro Gly Arg Gly Glu Arg Tyr Val Gly Xaa 
85 90 



(2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 

Met Ala Arg Arg Ser Ala Phe Pro Ala Ala Ala Leu Trp Leu Trp Ser 
. 1 . 5 10 15 

He Leu Leu Cys Leu Leu Ala Leu Arg Ala Glu Ala Gly Pro Pro Gin 
20 25 30 

Glu Glu Ser Leu Tyr Leu Trp He Asp Ala His Gin Ala Arg Val Leu 
35 40 45 

He Gly Phe Glu Glu Asp He Leu He Val Ser Glu Gly Lys Met Ala 
50 55 60 

Pro Phe Thr His Asp Phe Arg Lys Ala Gin Gin Arg Met Pro Ala He 
^5 70 75 80 

Pro val Asn He His Ser Met Asn Phe Thr Trp Gin Ala Ala Gly Gin 
85 90 95 

Ala Glu Tyr Phe Tyr Glu Phe Leu Ser Leu Arg Ser Leu Asp Lys Gly 
100 105 110 
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He Met Ala Asp Pro Thr Val Asn Val Pro Leu Leu Gly Thr Val Pro 
115 120 125 

His Lys Ala Ser Val Val Gin Val Gly Phe Pro Cys Leu Gly Lys Gin 
130 135 140 

Asp Gly Val Ala Ala Phe Glu Val Asp Val He Val Met Asn Ser Glu 
145 150 155 160 

Gly Asn Thr He Leu Gin Thr Pro Gin Asn Ala He Phe Phe Lys Thr 
165 170 175 

Cys Gin Gin Ala Glu Cys Pro Gly Gly Cys Arg Asn Gly Gly Phe Cys 
180 185 190 

Asn Glu Arg Arg He Cys Glu Cys Pro Asp Gly Phe His Gly Pro His 
195 200 205 

Cys Glu Lys Ala Leu Cys Thr Pro Arg Cys Met Asn Gly Gly Leu Cys 
210 215 220 

Val Thr Pro Gly Phe Cys He Cys Pro Pro Gly Phe Tyr Gly Val Asn 
225 230 235 240 

Cys Asp Lys Ala Asn Cys Ser Thr Thr Cys Phe Asn Gly Gly Thr Cys 
245 250 255 

Phe Tyr Pro Gly Lys Cys He Xaa Pro Pro Gly Leu Glu Gly Glu Gin 
260 265 270 

Cys Glu lie Ser Lys Cys Pro Gin Pro Cys Arg Asn Gly Gly Lys Cys 
275 280 285 

He Gly Lys Ser Lys Cys Lys Xaa Ser Lys Gly Tyr Gin Gly Asp Leu 
290 295 300 

Cys Ser Lys Pro Val Cys Glu Pro Gly Cys Gly Ala His Gly Thr Cys 
305 310 315 320 

His Glu Pro Asn Lys Cys Gin Cys Gin Glu Gly Trp His Gly Arg His 
325 330 335 

^Cys Asn Lys Arg Tyr Glu Ala Ser Leu He His Ala Leu Arg Pro Ala 
340 345 350 

Gly Ala Gin Leu Arg Gin His Thr Pro Ser Leu Lys Lys Ala Glu Glu 
355 360 365 



Arg Arg Asp Pro Pro Glu Ser Asn Tyr He Trp Xaa 
370 375 380 



(2) INFORMATION FOR SEQ ID NO: 427: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 
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Met Thr Ser Asn Leu Leu Leu Leu Thr leu Leu Leu Lys Asp Thr Leu 
1 5 10 15 

Xaa Leu Ala Lys Xaa Asn Xaa Xaa 
20 



(2) INFORMATION FOR SEQ ID NO: 428: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 andno acids 

(B) TyPE: amino acid 
(D) TOPOLOGV: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NC: 423: 

Met Arg His His Thr Gin Leu Asn Phe He Phe Leu Val Giu Met Val 
1 5 ' 10 15 

Phe Leu His Val Gly Gin Ala Gly Leu Lys Leu Pro Thr Ser Giy Asp 
20 25 30 

Xaa Ala Cys Phe Gly 'Leu Pro Lys Val Leu Gly Leu Gin Aia Zaa 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 429: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SE5QUENCE DESCRIPTION: SEQ ID NO: 429: 

Met Cys Ser Asp Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO: 430: 

(i) SEQUENCE CH7\RACTERISTICS : 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 430: 

Leu Leu Ser He Leu Leu Cys Leu Leu Ala Ser Gly Leu Val Val Phe 
1 5 10 15 

Phe Leu Phe Pro His Ser Val Leu Val Asp Asp Asp Gly He Lys Val 
20 25 30 

Val Lys Val Thr Phe Asn Lys Gin Asp Ser Leu Val He Leu Thr He 
35 40 45 

Met Ala Thr Leu Lys He Arg Asn Ser Asn Phe Tyr Thr Val Ala Val 
50 55 oX) 
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Thr Ser Leu Ser Ser Gin He Gin Tyr Met Asn Thr Val Val Asn Phe 
65 70 . 75 80 

Thr Gly Lys Ala Glu Met Gly Gly Pro Phe Ser Tyr Val Tyr Phe Phe 
85 90 95 

Cys Thr Val Pro Glu He Leu Val His Asn He Val He Phe Met Arg 
100 105 110 

Thr Ser Val Lys He Ser Tyr He Gly Leu Met Thr Gin Ser Ser Leu 
115 120 125 

Glu Thr His His Tyr Val Asp Cys Gly Gly Asn Ser Thr Ala He Xaa 
130 135 140 



(2) INFORMATIC»J FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) T^E: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 431: 

Met Phe Phe Phe Leu Tyr Val Tyr Ser Val Leu Cys Gly Leu Leu Val 
1 5 10 15 

Tyr Pro Ser Leu Pro Ser His Ser Val Ser Leu Val Thr Ser Leu Val 
20 25 30 

Ala Ser Ala Leu Xaa 
35 



(2) INFORMATION FOR SEQ ID NO: 432: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid . 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 

Met Ala Ser He Asn Ala Val Tyr He His Val Phe Leu Gly Val Cys 
15 10 15 

Val Gin Ala Thr Ala Ala Cys Pro Trp Cys Ser Gin Cys Arg Xaa Gly 
20 25 30 

Ser Val Pro Ser Xaa 
35 



(2) INFORMATION FOR SEQ ID NO: 433: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 

Met Met Ala Ala Met Val Leu Thr Ser Leu Ser Cys Ser Pro Val Val 
15 10 15 

Gin Ser Pro Pro Gly Thr Glu Ala Asn Phe Ser Ala Ser Arg Ala Ala 
20 25 30 

Cys Asp Pro Trp Lys Glu Ser Gly Asp lie Ser Asp Ser Gly Xaa Ser 
35 40 45 

Thr Thr Ser Gly His Trp Ser Gly Ser Ser Gly Val Ser Thr Pro Ser 
50 55 60 

Pro Pro His Pro Gin Ala Ser Pro Lys Tyr Leu Gly Asp Ala Phe Gly 
65 70 75 80 

Ser Pro Gin Thr Asp His Gly Phe Glu Thr Asp Pro Asp Pro Phe Leu 
85 90 . . 95 

Leu Asp Glu Pro Ala Pro Arg Lys Arg Lys Asn Ser Val Lys Val Met 
100 105 110 

Tyr Lys Cys Leu Trp Pro Asn Cys Gly Lys Val Leu Arg Ser He Val 
115 120 125 

Gly He Lys Arg His Val Lys Ala Leu His Leu Gly Asp Thr Val Asp 
130 135 140 

Ser Asp Gin Phe Lys Arg Glu Glu Asp Phe Tyr Tyr Thr Glu Val Gin 
145 150 155 160 

Leu Lys Glu Glu Ser Ala Afa Ala Ala Ala Ala Ala Ala Ala Asp Pro 
165 170 175 

Gin Ser Leu Gly Leu Pro Pro Pro Ser Gin Leu Pro Pro Pro Ala Xaa 
180 185 190 



(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOI: SEQ ID NO: 434: 

Met Ser Thr Asn Tyr Leu Thr Asp Val Cys Ser Leu Phe Ser Tyr Leu 
1-5 10 .15 

Asn Tyr Leu Tyr Phe His His His Leu Pro Val Pro Asn Thr Xaa 



wo 98/54963 



584 



20 25 30 



(2) INFORMATIC»J FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCMPTION : SEQ ID NO: 435: 

Met Gly Phe Phe Phe Val Leu Phe Phe Leu Tyr Leu Ala Leu Ser Arg 
15 10 15 

Asp Trp Ser lie Asn Phe Leu Lys Asp His Arg He Asn Phe Phe Val 
20 25 30 

Ala Thr Ser Tyr Phe Ser Val Tyr Val Arg Gly Xaa Pro Xaa Val Pro 
35 40 45 

Ala Asp Thr Pro Leu Gly Pro Leu Leu Ser Leu Trp Leu His His Asn 
50 55 60 

Ala Phe Phe Ser lie Leu Pro Lys Phe Pro Glu Asn Xaa Xaa Phe Leu 
65 70 75 80 

He Leu Lys Lys Leu Val Val Glu Mfet Gly Trp Asp Leu Phe He Ser 
85 90 95 

Pro Glu Asn Lys Xaa 
100 



(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 436: 

Met Ala Arg Tyr Phe He Phe Phe He Leu Val Phe Met Lys Val Ser 
15 10 15 

Leu Asn Thr Thr Trp Pro Ala Pro Arg Pro Ala Thr Leu Arg Thr Ala 
20 25 30 

Asn Lys Ser Lys Xaa 
35 



(2) INFORMATION FOR SEQ ID NO: 437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 

Phe Ser Thr He Arg Ser Gly Leu Thr Asp Arg Ser Val Asn Phe Leu 
1 5 10 15 

Phe Leu Phe Leu Asp Val Pro Asp Cys Arg Leu Val Asn He Glu Leu 
20 25 30 

Mtet Ala Asn Ser Thr Val Thr His Ala Xaa 
35 40 



(2) INFORMATION FOR SEQ ID NO: 438: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 

Leu 
1 



(2) INFORMATION FOR SEQ ID NO: 439: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 andno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOT: SBQ ID NO: 439: 

Met Pro Trp Arg Arg Ala Gly Leu Met Met Leu Pro He He Thr Gly 
1 5 10 15 

Cys Cys Pro Cys Ser Ala Ser He Xaa 
20 .25 



(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOI: SBQ ID NO: 440: 

Met Tyr Leu Cys Lys Thr Val Lys Val Leu He Cys Tyr Asp Trp He 
1 5 10 • 15 

Leu Gly Leu Val Ser Ser Gly Gin His Trp Val Val Ser Leu Ser Tyr 
20 25 30 

Ser He Arg Val Tyr Pro Ala Met His Phe Thr Leu Cys Val His He 
35 40 45 



Tyr Ser Lys Glu Pro Cys 
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50 



5 (2) INFORMATION FOR SBQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IjENGTH: 42 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

Met Thr Ala Leu Val Trp Arg Lys Gly Pro Asp Gly Gly Ser Arg Lys 
15 10 15 

Pro lie Leu Leu Leu Phe Phe Phe Leu Pro Leu He Leu Cys Phe His 
20 25 30 

Ser Phe He His Ser Ser Asn He Cys Xaa 
20 35 40 



25 



(2) INFORMATION FOR SEQ ID NO: 442: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 442: 

Met Phe Leu Thr Thr Trp Phe Leu Leu Leu Ser Val Ala .Trp Xaa Ala 
1 5 10 15 

35 Leu Thr Arg Ser Gly Arg Ser Cys Leu Pro Leu Val Gly Arg Pro Arg 
20 25 30 



40 



50 



Glu Gin Ser Pro Arg Thr His Cys Ala Ala Ser Ser Thr Lys Glu Arg 
35 40 45 

Asn Ser Asp Pro Gin Pro Ser Pro Pro Glu Val Val Gly Pro Leu Trp 
50 .55 60 



Ser Xaa 
45 65 



(2) INFORMATION FOR SEQ ID NO: 443: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTI<»J: SEQ ID NO: 443: 

Met Lys Ala lie Gly He Glu Pro Ser Leu Ala Thr Tyr His His He 
15 10 15 



60 



He Arg Leu Phe Asp Gin Pro Gly Asp Pro Leu Lys Arg Ser Ser Phe 
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20 



25 



30 



lie lie Tyr Asp lie Met Asn Glu Leu Met Gly Lys Arg Phe Ser Pro 
35 40 45 

5 

Lys Asp Pro Asp Asp Asp Lys Phe Phe Gin Ser Ala Met Ser lie Cys 
50 55 60 

Ser Ser Leu Arg Asp Leu Glu Leu Ala Tyr Gin Val His Gly Leu Leu 
10 65 70 75 80 

Lys Thr Gly Asp Asn Trp Lys Phe lie Gly Pro Asp Gin His Arg Asn 
85 90 95 

15 Phe Tyr Tyr Ser Lys Phe Phe Asp Leu lie Cys Leu Met Glu Gin lie 
100 105 110 



20 



Asp Val Thr Leu Lys Trp Tyr Glu Asp Leu lie Pro Ser Ala Tyr Phe 
115 120 125 

Pro His Ser Gin Thr Met lie His Leu Leu Gin Ala Leu Asp Val Ala 
130 135 140 



Asn Arg Leu Glu Val He Pro Lys He Trp Glu Arg 
25 145 150 155 



(2) INFORMATION FOR SEQ ID NO: 444: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 

Met His Phe Leu Phe Arg Phe He Val Phe Phe Tyr Leu Trp Gly Leu 
1 5 10 15 

40 Phe Thr Ala Gin Arg Gin Lys Lys Glu Glu Ser Thr Glu Glu Val Lys 
20 25 30 

He Glu Val Leu His Arg Pro Glu Asn Cys Ser Lys Thr Ser Lys Lys 
35 40 45 

45 

Gly Asp Leu Leu Lys Cys Pro Leu Xaa 
50 55 



50 

(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 



30 



35 



60 



Met Arg Thr Leu Phe Asn Leu Leu Trp Leu Ala Leu Ala Cys Ser Pro 
15 10 15 
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10 



15 



20 



25 



30 



35 



40 



45 



Val His Thr Thr Leu Ser Lys Ser Asp Ala Lys Lys Ala Ala Ser Lys 
20 25 30 

Thr Leu Leu Glu Lys Ser Gin Phe Ser Asp Lys Pro Val Gin Asp Arg 
35 40 45 

Gly Leu Val Val Thr Asp Leu Lys Ala Glu Ser Val Val Leu Glu His 
50 55 60 

Arg Ser Tyr Cys Ser Ala Lys Ala Arg Asp Arg His Phe Ala Gly Asp 
65 70 75 80 

Val Leu Gly Tyr Val Thr Pro Trp Asn Ser His Gly Tyr Asp Val Thr 
85 90 95 

Lys Val Phe Gly Ser Lys Phe Thr Gin He Ser Pro Val Trp Leu Gin 
100 105 110 

Leu Lys Arg Arg Gly Arg Glu Met Phe Glu Val Thr Gly Leu His Asp 
115 120 125 

Val Asp Gin Gly Trp Met Arg Ala Val Arg Lys His Ala Lys Gly Leu 
130 135 140 

His He Val Pro Arg Leu Leu Phe Glu Asp Trp Thr Tyr Asp Asp Phe 
145 150 155 160 

Arg Asn Val Leu Asp Ser Glu Asp Glu He Glu Glu Leu Ser Lys Thr 
155 170 175 

Val Val Gin Val Ala Lys Asn Gin His Phe Asp Gly Phe Val Val Glu 
180 185 190 

Val Trp Asn Gin Leu Leu Ser Gin Lys Arg Val Gly Leu He His Met 
195 200 205 

Leu Thr His Leu Ala Glu Ala Leu His Gin Ala Arg Leu Leu Ala Leu 
210 215 220 

Leu Val He Pro Pro Ala He Thr Pro Gly Thr Asp Gin Leu Gly Met 
225 230 235 240 

Phe Thr His Lys Glu Phe Glu Gin Leu Ala Pro Val Leu Asp Gly Phe 
245 250 255 

Ser Leu Met Thr Tyr Asp Tyr Ser Thr Ala His Gin Pro Gly Pro Asn 
260 265 270 



50 



55 



Ala Pro Leu Ser Trp Val Arg Ala Cys Veil Gin Val Leu Asp Pro Lys 
275 280 285 

Ser Lys Trp Arg Ser Lys He Leu Leu Gly Leu Asn Phe Tyr Gly Met 
290 295 300 

Asp Tyr Ala Thr Ser .Lys Asp Ala Arg Glu Pro Val Val Gly Ala Arg 
305 310 315 320 



Tyr He Gin Thr Leu Lys Asp His Arg Pro Arg Met Val Trp Asp Ser 
60 325 330 335 
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Gin Xaa Ser Glu His Phe Phe Glu Tyr Lys Lys Ser Arg Ser Gly Arg 
340 345 350 

His Val Val Phe Tyr Pro Thr Leu Lys Ser Leu Gin Val Arg Leu Glu 
355 360 365 

Leu Ala Arg Glu Leu Gly Val Gly Val Ser He Trp Glu Leu Ala Arg 
370 375 380 

Ala Trp Thr Thr Ser Thr Thr Cys Ser Arg Trp Ala Leu Arg Pro Pro 
385 390 395 400 

Arg Trp Thr Cys Ser Phe Leu Ser His Gly Val Ser Glu Gin Val Xaa 
405 410 415 



(2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I^NGTH: 64 amino acids 

(B) TYPE: aniino acid 
(D) TOPOLOGY: linear 

{xi> SEC3UENCE DESCRIPTION: SBQ ID NO: 446: 

Met Ala Pro Gly Pro Leu Ser Ala Thr Gin Ala Val Val He His Thr 
15 10 15 

Thr His Cys Leu Gin Leu Pro Val Trp Cys Leu Ser Leu Val Ser Glu 
20 25 30. 

Leu Leu Gly Arg Ala Pro Pro His Asn Lys Asp Ala Leu Arg Pro Ser 
35 40 45 

Lys Lys Lys Lys Lys Lys Leu Xaa Gly Gly Pro Val Pro He Pro Pro 
50 55 . '60 



(2) INFORMATICMI FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 

Met Leu Gly Ala Lys Pro His Trp Leu Pro Gly Pro Leu His Ser Pro 
1.5 10 15 

Gly Leu Pro Leu Val Leu Val Leu Leu Ala Leu Gly Ala Gly Tip Ala* 
20 25 30 



wo 98/54963 



590 



Gin Glu Gly Ser Glu Pro Val Leu Leu Glu Gly Glu Cys Leu Val Val 
35 40 45 

Cys Glu Pro Gly Arg Ala Ala Ala Gly Gly Pro Gly Gly Ala Ala Leu 
50 55 60 

Gly Glu Ala Pro Pro Gly Arg Val Ala Phe Ala Ala Val Arg Ser Xaa 
65 70 75 80 

His His Glu Pro Ala Gly Glu Thr Gly Asn Gly Thr Xaa Gly Ala lie 
85 90 95 

Tyr Phe Asp Gin Val Leu Val Asn Glu Gly Gly Gly Phe Asp Arg Ala 
100 105 110 

Ser Gly Ser Phe Val Ala Pro Val Arg Gly Val Tyr Ser Phe Arg Phe 
115 120 125 

His Val Val Lys Val Tyr Asn Arg Gin Thr Val Gin Val Ser Leu Met 
130 135 140 

Leu Asn Thr Trp Pro Val He Ser Ala Phe Ala Asn Asp Pro Asp Val 
145 150 155 X60 

Thr Arg Glu Ala Ala Thr Ser Ser Val Leu Leu Pro Leu Asp Pro Gly 
165 170 175 

Asp Arg Val Ser Leu Arg Leu Arg Arg Gly Asn Leu Leu Gly Gly Trp 
180 185 190 

Lys Tyr Ser Ser Phe Ser Gly Phe Leu He Phe Pro Leu Xaa 
195 200 205 



(2) INFORMATIC^J FOR SEQ ID NO; 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids . 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 448: 

Met Ser Ser Leu Leu Ser Ala Gly Leu Gin Ala Ser Leu Cys Gly Lys 
15 10 15 

Xaa Leu Trp Ala Ser Thr Trp Tyr Leu Val Cys Cys Leu Leu Pro Phe 
20 25 • 30 

Phe His Gin Gly Cys Cys Asp His Lys Ser Lys Gin Gin Tyr He Pro 
35 40 45 

Asn Leu Lys Ser Tyr Cys Gly Leu Ser Thr He Glu He Xaa 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 449: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 

Met Ser Thr Lys Lys Leu Cys lie Val Gly Gly lie Leu Leu Val Phe 
1 5 • . 10 15 

Gin He He Ala Phe Leu Val Gly Gly Leu He Ala Pro Gly Pro Thr 
20 25 30 

Thr Ala Val Ser Tyr Mfet Ser Val Lys Cys Val Asp Ala Arg Lys Asn 
35 ■ 40 45 

His His Lys Thr Lys Trp Phe Val Pro Trp Gly Pro Asn His Cys Asp 
50 55 60 

Lys He Arg Asp He Glu Glu Ala He Pro Arg Glu He Glu Ala Asn 
65 70 75 80 



Asp He Val Phe Ser Val His He Pro Leu Pro His Met Glu Met Ser 
85 90 95 

Pro Trp Phe Gin Phe Met Xaa Phe He Leu Gin Leu Asp He Ala Phe 
100 105 110 

Lys Leu Asn Asn Gin He Arg Glu Asn. Ala Glu Val Ser Mfet Asp Val 
115 120 125 

Ser Leu Ala Tyr Arg Asp Asp Ala Phe Ala Glu Tip Thr Glu Met Ala 
130 135 140 

His Glu Arg Val Pro Arg Lys Leu Lys Cys Thr Phe Thr Ser Pro Lys 
145 150 155 160 

Thr Pro Glu His Gly Gly Pro Val Thr Met Asn Val Met Ser Phe Leu 
165 170 175 

Ser Trp Lys Leu Gly Leu Trp Pro Met Lys Phe Tyr Leu Leu Asn He 
180 185 190 



Arg Leu Pro Val Asn Glu Lys Lys Lys He Asn Val Gly He Gly Glu 
195 200 205 

He Lys Asp He Arg Leu Val Gly He His Gin Asn Gly Gly Phe Thr 
210 215 220 

Lys Val Trp Phe Ala Met Lys Thr Phe Leu Thr Pro Ser He Phe lie 
225 230 235 - 240 

He Mfet Val Trp Tyr Trp .Arg Arg He Thr Met Met Ser Arg Pro Pro 
245 250 255 

Val Leu Leu Glu Lys Val He Phe Ala Leu Gly He Ser Met Thr Phe 
260 265 270 



He Asn He Pro Val Glu Trp Phe Ser He Gly Phe Asp Trp Thr Trp 
275 280 285 



Met 



Leu Leu Phe Gly Asp He Arg Gin Ala Ser Ser Met Xaa Cys Phe 
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290 



295 



300 



Xaa Pro Ser Gly Ser Ser Ser Val Ala Ser Thr Xaa 
305 310 315 



(2) INFORMATION FOR SEQ ID NO: 450: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICMJ : SEQ ID NO: 450: 

Met Leu Ala Leu Leu Gly Leu Leu Ala Gly Thr Glu His Pro Pro Gly 
1 5 .10 15 

Pro Gin Gly Pro Gly Pro Ser Xaa 
20 



(2) INFORMATION FOR SEQ ID NO: 451: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEIK3TH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 

Met Pro Ser Gly Ala Cys Cys Ser Pro Xaa 
15 10 



(2) INFORMATION FOR SEQ IQ NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 

Met Leu Pro Ala Leu Ser Thr Val Leu Leu Pro Thr Pro Ser Leu Cys 
15 10 15 

Ser Gly Asn Pro Arg Glu Gly Trp Ala. Xaa 
20 . . 25 



(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 
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Met Tyr Ser Leu His Ser Trp Val Gly Leu He Ala Val He Cys Tyr 
15 10 15 

Leu Leu Gin Leu Leu Ser Gly Phe Ser Val Phe Leu Leu Pro Trp Ala 
20 25 30 

Pro Leu Ser Leu Arg Ala Phe Leu Met Pro He His Val Tyr Ser Gly 
35 40 45 

He Val He Phe Gly Thr Val lie Ala Thr Ala Leu Met Gly Leu Thr 
50 55 60 

Glu Lys. Leu He Phe Ser Leu Arg Asp Pro Ala Tyr Ser Thr Phe Pro 
65 70 75 80 

Pro Glu Gly Val Phe Val Asn Thr Leu Gly Leu Leu He Leu Val Phe 
85 90 95 



Gly Ala Leu He Phe Trp He Val Thr Arg Pro Gin Trp Lys Arg Pro 
100 105 ' 110 

Lys Glu Pro Asn Ser Thr He Leu His Pro Asn Gly Gly Thr Glu Gin 
115 120 125 

Gly Ala Arg Gly Ser Met Pro Ala Tyr Ser Gly Asn Asn Met Asp Lys 
130 135 140 

Ser Asp Ser Glu Leu Asn Xaa Glu Val Ala Ala Arg Lys Arg Asn Leu 
145 150 155 160 

Ala Leu Asp Glu Ala Gly Gin Arg Ser Thr Met Xaa 
165 170 



(2) INFORMATION FOR SEQ ID NO: 454: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 axnino acids 

(B) TYPE: amino acid 
(D). TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454: 

Met Phe His Val Leu Met Ala Gin Val Thr Xaa Val He He Thr Thr 
15 10 15 

Val Ser Val Leu Val Phe Asp Phe Arg Pro Ser Leu Glu Phe Phe Leu 
20 25 30 

Glu Ala Xaa Ser Val Xaa Leu Ser He Phe He Tyr Asn Ala Ser Lys 
35 40 45 

Pro Gin Val Pro Glu Tyr Ala Pro Arg Gin Glu Arg He Arg Asp Leu 
50 55 60 

Ser Gly Asn Leu Trp Glu Arg Ser Ser Gly Asp Gly 
65 70 75 



Glu Glu Leu Glu 
80 



Arg Leu Thr Lys 



Pro Lys Ser Asp Glu Ser Asp Glu 
85 90 



Asp Thr Phe Xaa 
95 
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(2) INP0PMATIC»1 FOR SEQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 

Met Arg Gly Pro Ala Gin Ala Lys Leu Leu Pro Gly Ser Ala lie Gin 
15 10 15 

Ala Leu Val Gly Leu Ala Arg Pro Leu Val Leu Ala Leu Leu Leu Val 
20 25 30 

Ser Ala Ala Leu Ser Ser Val Val Ser Arg Thr Asp Ser Pro Ser Pro 
35 40 45 

Thr Val Leu Asn Ser His lie Ser Thr Pro Asn Val Asn Ala Leu Thr 
50 55 60 

His Glu Asn Gin Thr Lys Pro Ser lie Ser Gin He Ser Thr Thr Leu 
65 70 75 80 

Pro Pro Thr Thr Ser Thr Lys Lys Ser Gly Gly Ala Ser Val Val Pro 
85 90 95 

His Pro Ser Pro Thr Pro Leu Ser Gin Glu Glu Ala Asp Asn Asn Glu 
100 105 110 

Asp Pro Ser He Glu Glu Glu Asp Leu Leu Met Leu Asn Ser Ser Pro 
115 ^ . 120 125 

Ser Thr Ala Lys Asp Thr Leu Asp Asn Gly Asp Tyr Gly Glu Pro Asp 
130 135 140 

Tyr Asp Trp Thr Thr Gly Pro Arg Asp Asp Asp Glu Ser Asp Xaa His 
145 150 155 160 

Leu Gly Arg Lys Gin Gly Leu His Gly Asn Xaa 
165 170 



(2) INFORMATION FOR SEQ ID NO: 456: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 aznino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 



Mfet Lys Ala Ser Gin Cys Cys Cys Cys Leu Ser His Leu Leu Ala Ser. 
15 10 15 
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Val Leu Leu Leu Leu Leu Leu Pro Glu Leu Ser Gly Xaa Leu Xaa Val 
20 25 30 

Leu Leu Gin Ala Ala Glu Ala Ala Pro Gly Xaa Gly Pro Pro Asp Pro 
35 40 45 

Arg Pro Gly His Tyr Arg Arg Cys His Arg Ala Leu Thr Pro Ala Gin 
50 55 60 

Gin Pro Gly Arg Gly Leu Ala Glu Ala Ala Gly Ala Ala Gly Leu Arg 
65 70 75 80 

Gly Arg Gin Trp Gin Gin Pro Cys Gly Arg Ala Xaa 
.85 90 



(2) INFOFMATION FOR SEQ ID NO: 457: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 amino acids 

(B> TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 

He Ser Val Leu Xaa Tyr Pro His Cys Val Val His Glu Leu Pro Glu 
1 5 10 15 

Leu Thr Ala Glu Ser Leu Glu Ala Gly Asp Ser Asn Gin Phe Cys Trp 
20 25 30 

Arg Asn Leu Phe Ser Cys He Asn Leu Leu Arg He Leu Asn Lys Leu 
35 40 45 . 

Thr Lys Trp Lys His Ser Arg Thr Met Met Leu Val Val Phe Lys Ser 
50 55 60 

Ala Pro He Leu Lys Arg Ala Leu Lys Val Lys Gin Ala Met Met Gin 
65 70 75 80 

Leu Tyr Val Leu Lys Leu Leu Lys Val Gin Thr Lys Tyr Leu Gly Arg 
85 90 95 

Gin Trp Arg Lys Ser Asn Met Lys Thr Met Ser Ala He Tyr Gin Lys 
100 105 110 

Val Arg His Arg Leu Asn Asp Asp Trp Ala Tyr Gly Asn Asp Leu Asp 
115 120 125 

Ala Arg Pro Trp Asp Phe Gin Ala Glu Glu Cys Ala Leu Arg Ala Asn 
130 135 140 

He Glu Arg Phe Asn Ala Arg Arg Tyr Asp Arg Ala His Ser Asn Pro 
145 150 155 160 

Asp Phe Leu Pro Val Asp Asn Cys Leu Gin Ser Val Leu Gly Gin Arg 
165 170 175 



Val Asp Leu Pro Glu Asp Phe Gin Met Asn Tyr Asp Leu Trp Leu Glu 
180 185 190 
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Arg Glu Val Phe Ser Lys Pro lie Ser Trp Glu Glu Leu Leu 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 458: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 317 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 458: 

15 Met Ala Pro Pro Ala Pro Gly Pro Ala Ser Gly Gly Ser Gly Glu Val 
15 10 15 



20 



35 



50 



Asp Glu Leu Phe Asp Val Lys Asn Ala Phe Tyr He Gly Ser Tyr Gin 
20 25 30 

Gin Cys He Asn Glu Ala Xaa Xaa Val Lys Leu Ser Ser Pro Glu Afg 
35 40 45 



Asp Val Glu Arg Asp Val Phe Leu Tyr Arg Ala Tyr Leu Ala Gin Arg 
25 50 55 60 

Lys Phe Gly Val Val Leu Asp Glu He Lys Pro Ser Ser Ala Pro Glu 
65 70 75 80 

30 Leu Gin Ala Val Arg Met Phe Ala Asp Tyr Leu Ala His oiu Ser Arg 

85 90 95 



Arg Asp Ser He Val Ala Glu Leu Asp Arg Glu Met Ser Arg Ser Xaa 
100 105 110 

Asp Val Thr Asn Thr Thr Phe Leu Leu Met Ala Ala Ser He Tyr Leu 
115 120 125 



His Asp Gin Asn Pro Asp Ala Ala Leu Arg Ala Leu His Gin Gly Asp 
40 130 135 140 

Ser Leu Glu Cys Thr Ala Met Thr Val Gin He Leu Leu Lys Leu Asp 
145 150 155 160 

45 Arg Leu Asp Leu Ala Arg Lys Glu Leu Lys Arg Met Gin Asp Leu Asp 
165 170 175 



Glu Asp Ala Thr Leu Thr Gin Leu Ala Thr Ala Trp Val Ser Leu Ala 
180 185 190 

Thr Gly Gly Glu Lys Leu Gin Asp Ala Tyr Tyr He Phe Gin Glu Met 
195 200 205 



Ala Asp Lys Cys Ser Pro Thr Leu Leu Leu Leu Asn Gly Gin Ala Ala 
55 210 215 220 

Cys His Met Ala Gin Gly Arg Trp Glu Ala Ala Glu Gly Leu Leu Gin 
225 230 235 240. 

60 Glu Ala Leu Asp Lys Asp Ser Gly Tyr Pro Glu Thr Leu Val Asn Leu 
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245 250 255 

lie Val Leu Ser Gin His Leu Gly Lys Pro Pro Glu Val Thr Asn Arg 
260 265 270 

Tyr Leu Ser Gin Leu Lys Asp Ala His Arg Ser His Pro Phe He Lys 
275 . 2B0 285 



Glu Tyr Gin Ala Lys Glu Asn Asp Phe Asp Arg Leu Val Leu Gin Tyr 
10 290 295 300 



15 



Ala Pro Ser Ala Glu Ala Gly Pro Glu Leu Ser Gly Pro 
305 310 315 



(2) INFORMATION FOR SEQ ID NO: 459: 



(i) SEQXffiNCE CHARACTERISTICS: 
20 (A) LENGTH: 261 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 459: 

25 Arg Asp Val Glu Arg Asp Val Phe Leu Tyr Arg Ala Tyr Leu Ala Gin 
15 10 15 

Arg Lys Phe Gly Val Val Leu Asp Glu He Lys Pro Ser Ser Ala Pro 
20 25 30 



30 



45 



Glu Leu Gin Ala Val Arg Met Phe Ala Asp Tyr Leu Ala His Glu Ser 
35 40 45 



Arg Arg Asp Ser He Val Ala Glu Leu Asp Arg Glu Met Ser Arg Ser 
35 50 55 60 

Xaa Asp Val Thr Asn Thr Thr Phe Leu Leu Met Ala Ala Ser He Tyr 
65 70 ' 75 80 

40 Leu His Asp Gin Asn Pro Asp Ala Ala Leu Arg Ala Leu His Gin Gly 

85 90 95 



Asp Ser Leu Glu Cys Thr Ala Met Thr Val Gin He Leu Leu Lys Leu 
100 105 110 

Asp Arg Leu Asp Leu Ala Arg Lys Glu Leu Lys Arg Met Gin Asp Leu 
115 120 125 



Asp Glu Asp Ala Thr Leu Thr Gin Leu Ala Thr Ala Trp Val Ser Leu 
50 130 135 140 

Ala Thr Gly Gly Glu Lys Leu Gin Asp Ala Tyr Tyr He Phe Gin Glu 
145 150 155 160 

55 Met Ala Asp Lys Cys Ser Pro Thr Leu Leu Leu Leu Asn Gly Gin Ala 
165 170 175 



60 



Ala Cys His Met Ala Gin Gly Arg Trp Glu Ala Ala Glu Gly Leu Lea 
180 185 190 
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Gin Glu Ala Leu Asp Lys Asp Ser Gly Tyr Pro Glu Thr Leu Val Asn 
195 200 205. 

Leu He Val Leu Ser Gin His Leu Gly Lys Pro Pro Glu Val Thr Asn 
. 210 215 220 

Arg Tyr Leu Ser Gin Leu Lys Asp Ala His Arg Ser His Pro Phe He 
225 230 235 240 

Lys Glu Tyr Gin Ala Lys Glu Asn Asp Phe Asp Arg Leu Val Leu Gin 
245 250 255 

Tyr Ala Pro Ser Ala 
260 



(2) INFORMATION FOR SEQ ID NO: 460: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 

Met Lys Ala He Gly He Glu Pro Ser Leu Ala Thr Tyr His His He 
15 10 15 

He Arg Leu Phe Asp Gin Pro Gly Asp Pro Leu Lys Arg Ser Ser Phe 
20 25 30 

He He Tyr Asp He Met Asn Glu Leu Met Gly Lys Arg Phe Ser Pro 
35 40 45 

Lys Asp Pro Asp Asp Asp Lys Phe Phe Gin Ser Ala Met Ser He Cys 
50 55 60 

Ser Ser Leu Arg Asp Leu Glu Leu Ala Tyr Gin Val His Gly Leu Leu 
65 70 75 80 

Lys Thr Gly Asp Asn Trp Lys Phe He Gly Pro Asp Gin His Arg Asn 
85 90 95 

Phe Tyr Tyr Ser Lys Phe Phe Asp Leu He Cys Leu Mtet Glu Gin He 
100 105 110 

Asp Val Thr Leu Lys Trp Tyr Glu Asp Leu He Pro Ser Ala Tyr Phe 
115 120 125 

Pro His Ser Gin Thr Met He His Leu Leu Gin Ala Leu Asp Val Ala 
130 135 140 

Asn Arg Leu Glu Val He Pro Lys He Trp Glu Arg 
145 150 155 



<2) INFORMATION FOR SEQ ID NO: 461: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 176 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461: 

Lys Asp Ser Lys Glu Tyr Gly His Thr Phe Arg Ser Asp Leu Arg Glu 
15 10 15 

Glu lie Leu Met Leu Met Ala Arg Asp Lys His Pro Pro Glu Leu Gin 
20 25 30 

Val Ala Phe Ala Asp Cys Ala Ala Asp lie Lys Ser Ala Tyr Glu Ser 
35 .40 45 

Gin Pro He Arg Gin Thr Ala Gin Asp Trp Pro Ala Thr Ser Leu Asn 
50 55 60 

Cys He Ala He Leu Phe Leu Arg Ala Gly Arg Thr Gin Glu Ala Trp 
65 70 75 80 

Lys Met Leu Gly Leu Phe Arg Lys His Asn Lys He Pro Arg Ser Glu 
85 90 95 

Leu Leu Asn Glu Leu Met Asp Ser Ala Lys Val Ser Asn Ser Pro Ser 
100 105 110 

Gin Ala He Glu Val Val Glu Leu Ala Ser Ala Phe Ser Leu Pro He 
115 120 125 

Cys Glu Gly Leu Thr Gin Arg Val Met Ser Asp Phe Ala He Asn Gin 
130 135 140 

Glu Gin Lys Glu Ala Leu'Ser Asn Leu Thr Ala Leu Thr Ser Asp Ser 
145 150 155 160 

Asp Thr Asp Ser Ser Ser Asp Ser Asp Ser Asp Thr Ser Glu Gly Lys 
165 170 175 



(2) INFORMATKMJ FOR SEQ ID NO: 462: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 

Met Ser Ser Asp Asn Glu Ser Asp He Glu Asp Glu Asp Leu Lys Leu 
15 10 15 

Glu Leu Arg Arg Leu Arg Asp Lys His Leu Lys Glu He Gin Asp Leu 
20 25 30 



Gin Ser Arg Gin Lys His Glu He Glu Ser Leu Tyr Thr Lys i:»eu. Gly 
35 40 45 
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Lys Val Pro Pro Ala Val He He Pro Pro Ala Ala Pro Leu Ser Gly 
50 55 60 

Arg Arg Arg Arg Pro Thr Lys Ser Lys Gly Ser Lys Ser Ser Arg Ser 
5 65 70 75 80 

Ser Ser Leu Gly Asn Lys Ser Pro Gin Leu Ser Gly Asn Leu Ser Gly 
85 90 95 

10 Gin Ser Ala Ala Ser Val Leu His Pro Gin Gin Thr Leu His Pro Pro 
100 105 110 

Gly Asn He Pro Glu Ser Gly Gin Asn Gin Leu Leu Gin Pro Leu Lys 
115 120 125 

15 

Pro Ser Pro Ser Ser Asp Asn Leu Tyr Ser Ala Phe Thr Ser Asp Gly 
130 135 140 

Ala He Ser Val Pro Ser Leu Ser Ala Pro Gly Gin Gly Thr Ser Ser 
20 145 150 155 160 

Thr Asn Thr Val Gly Ala Thr Val Asn Ser Gin Ala Ala Gin Ala Gin 
165 . 170 175 

25 Pro Pro Ala Met Thr Ser Ser Arg Lys Gly Thr Phe Thr Asp Asp Leu 
180 185 190 

His Lys Leu Val Asp Asn Trp Ala Arg Asp Ala Met Asn Leu Ser Gly 

195 200 205 

30 

Arg Arg Gly Ser Lys Gly His Met Asn Tyr Glu Gly Pro Gly Met Ala 
210 215 220 

Arg Lys Phe Ser Ala Pro Gly Gin Leu Cys He Ser Met Thr Ser Asn 
35 225 230 235 240 

Leu Gly Gly Ser Ala Pro He Ser Ala Ala Ser Ala Thr Ser Leu Gly 
245 250 .255 

40 His Phe Thr Lys Ser Met Cys Pro Pro Gin Gin Tyr Gly Phe Pro Ala 
260 265 270 

Thr Pro Phe Gly Ala Gin Trp Ser Gly Thr Gly Gly Pro Ala Pro Gin 
275 280 285 

45 . - 

Pro Leu Gly Gin Phe Gin Pro Val Gly Thr Ala Ser Leu Gin Asn Phe 
290 295 300 

Asn He Ser Asn Leu. Gin Lys Ser He Ser Asn Pro Pro Gly Ser Asn 
50 305 310 315 320 

Leu Arg Thr Thr 

55 

(2) INFORMATION FOR SEQ ID NO: 463: 



60 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 

He Gin Asp Leu Gin Ser Arg Gin Lys His Glu lie Glu Ser Leu Tyr 
15 10 15 

Thr Lys Leu Gly Lys Val Pro Pro Ala Val He He Pro Pro Ala Ala 
20 25 30 

Pro Leu Ser Gly Arg Arg Arg Arg Pro Thr Lys Ser Lys Gly Ser Lys 
35 40 45 

Ser Ser Arg Ser Ser Ser Leu Gly Asn Lys Ser Pro Gin Leu Ser Gly 
50 55 60 

Asn Leu Ser Gly Gin Ser Ala Ala Ser Val Leu His Pro Gin Gin Thr 
65 70 75 80 

Leu His Pro Pro Gly Asn He Pro Glu Ser Gly Gin Asn Gin Leu Leu 
85 90 95 

Gin Pro Leu Lys Pro Ser Pro Ser Ser Asp Asn Leu Tyr Ser Ala Phe 
100 105 110 

Thr Ser Asp Gly Ala He Ser Val Pro Ser Leu Ser Ala Pro Gly Gin 
115 120 125 

Gly Thr Ser Ser Thr 
130 



(2) INFORMATION FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ^53 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»3 : SEQ ID NO: 464: 

Thr Ser Asp Gly Ala He Ser Val Pro Ser Leu Ser Ala Pro Gly Gin 
15 10 15 

Gly Thr Ser Ser Thr Asn Thr Val Gly Ala Thr Val Asn Ser Gin Ala 
20 25 30 

Ala Gin Ala Gin Pro Pro Ala Met Thr Ser Ser Arg Lys Gly Thr Phe 
35 40 45 

Thr Asp Asp Leu His 
50 



(2) INFORMATION FOR SEQ ID NO: 465: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465: 

Lys Gly His Met Asn Tyr Glu Gly Pro Gly Met Ala Arg Lys Phe Ser 
5 1 5 10 15 

Ala Pro Gly Gin Leu Cys He Ser Met Thr Ser Asn Leu Gly Gly Ser 
20 25 30 

10 Ala Pro He Ser Ala Ala Ser Ala Thr Ser Leu Gly His Phe Thr Lys 
35 40 45 



15 



(2) INFORMATION FOR SEQ ID NO: 4.66: 

20 (i) SeIsOENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 

25 

Gin Pro Leu Lys Pro Ser Pro Ser Ser Asp Asn Leu Tyr Ser Ala Phe 
1 5 10 15 

Thr Ser Asp Gly Ala He Ser Val Pro Ser Leu Ser Ala Pro Gly 
30 20 25 30 



35 



50 



(2) INFORMATION FOR SEQ ID NO: 467: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ^57 amino acids • 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 

Val Arg Val Ala Ala Ala Glu Ser Met Xaa Leu Leu Leu Glu Cys Ala 
1 5 10 15 

45 Xaa Val Arg Gly Pro Glu Tyr Leu Thr Gin Met Trp His Phe Met Cys 
20 25 30 

ASP Ala Leu He Lys Ala He Gly Thr Glu Pro Asp Ser Asp Val Leu 
35 . 40 45 . 



Ser Glu He Met His Ser Phe Ala Lys 
50 55 



55 

(2) INFORMATION FOR SEQ ID NO: 468: 



60 



(i) SEQUENCE CHAEIACTERISTICS : 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 

Met Glu lie Asn Asn Gin Asn Cys Phe lie Val He Asp Leu Val Arg 
15 10 15 

Thr Val Met Glu Asn Gly Val Glu Gly Leu Leu He Phe Gly Ala Phe 
20 25 30 

Leu Pro Glu Ser Trp Leu He Gly Val Arg Cys Ser Ser Glu Pro Pro 
35 40 45 

Lys Ala Leu Leu Leu He Leu Ala His Ser Gin Lys Arg Arg Leu Asp 
50 55 60 

Gly Trp Ser Phe He Arg His Leu Arg Val His Tyr Cys Val Ser Leu 
65 70 75 80 

Thr He His Phe Ser 
85 



(2) INFORMATIOSI FOR SEQ ID NO: 469: 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 

Gin Asp Lys His Ala Glu Glu Val Arg Lys Asn Lys Glu Leu Lys Glu 
1 5 10 15 

Glu Ala Ser Arg 
20 



(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 

Gin Gin Asp Leu Ser Pro Trp Ala Ala Pro Val Gly Cys Pro Leu Xaa 
15 10 15 

Xaa Ala Ser Xaa Thr Cys His Xaa Leu Pro Leu Ser Gly Cys Leu Arg 
20 25 30 

Arg Gin Ser Xaa Ser Leu Pro Val Val Ala Xaa Leu Cys Phe Trp Phe 
35 40 45 

Ser Cys Pro hen Ala Ser Leu Phe Val Pro Gly Gin Pro Cys Val Thr 
50 55 60 

Cys Pro Phe Pro Ser Leu Pro Phe Gin Asp Lys His Ala Glu Glu Val 
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70 



75 



80 



Arg Zr/s >-sr. l*.-3 Glu Leu Lys Glu Glu Ala Ser Arg 
35 90 



:2) =!3C?2-3Lri::: FCR SEQ id NO: 471: 

:i; sz:;"Jz::cE characteristics: 

(A) LE^JGTH; 37 amino acids 
(3) TY?E: amino acid 
O) TOPOLOGY: linear 
y.Li SZCC-ZTSCE DESCRIPTION: SEQ ID NO: 471: 

?ro rhr Arg Cys Cys Thr Thr Gin Pro Cys Arg Ser Ser Ala Arg Arg 
1 ' *5 10 15 

Pro Cys rrp Val Pro Met Val Pro Ser Pro Glu Gly Arg Glu Xaa Gin 
23 25 30 

Pro Tnr Cys Pro Ser 



?2) ZITCPZ-GTIC:: ?CR SEQ id NO: 472: 

'i) ss:;-"i::cs characteristics: 

(A) LENGTH: 363 amino acids 
(3) TY?5: amino acid 
(D) TOPOLOGY: linear 
'xi; S3QuE2;C3 DESCRIPTION: SEQ ID NO: 472: 

!-:er Lys Arg Ser Leu Asn Glu Asn Ser Ala Arg Ser Thr Ala Gly Cys 
15 10 15 



leu Pre Val Pro Leu Phe Asn Gin Lys Lys Arg Asn Arg Gin Pro Leu 
2: 25 30 

Thr Ser Asr. Pro Leu Lys Asp Asp Ser Gly lie Ser Thr Pro Ser Asp 
35 * 40 45 

Asn rvr Asp Phe Pro Pro Leu Pro Thr Asp Trp Ala Trp Glu Ala Val 
50 ' 55 60 

AST. Pro 31u Xaa Ala Pro Val Met Lys Thr Val Asp Thr Gly Gin He 
55 70 75 80 

Pro His Ser Val Ser Arg Pro Leu Arg Ser Gin Asp Ser Val Phe Asn 
85 90 95 

Ser lie Gin Ser Asn Thr Gly Arg Ser Gin Gly Gly Trp Ser Tyr Arg 
i:0 105 110 

Asp Gly Asr- Lys Asn Thr Ser Leu Lys Thr Trp Xaa Lys Asn Asp Phe 
115 " 120 125 



Lys Pro Gin Cys Lys Arg Thr Asn Leu Val Ala Asn Asp Gly Lys Asn 
130 135 140 
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Ser Cys Pro Met Ser Ser Gly Ala Gin Gin Gin Lys Gin Leu Arg Thr 
145 150 155 160 

Pro Glu Pro Pro Asn Leu Ser Arg Asn Lys Glu Thr Glu Leu Leu Arg 
165 170 175 

Gin Thr His Ser Ser Lys He Ser Gly Cys Thr Met Arg Gly Leu Asp 
180 185 190 

Lys Asn Ser Ala Leu Gin Thr Leu Lys Pro Asn Phe Gin Gin Asn Gin 
195 200 205 

Tyr Lys Xaa Gin Met Leu Asp Asp He* Pro Glu Asp Asn Thr Leu Lys 
210 215 220 

Glu Thr Ser Leu Tyr Gin Leu Gin Phe Lys Glu Lys Ala Ser Ser Leu 
225 230 235 240 

Arg He He Ser Ala Val He Glu Ser Met Lys Tyr Trp Arg Glu His 
245 250 255 

Ala Gin Lys Thr Val Leu Leu Phe Glu Val Leu Ala Val Leu Asp Ser 
260 265 270 

Ala Val Thr Pro Gly Pro Tyr Tyr Ser Lys Thr Phe Leu Met Arg Asp 
275 280 285 

Gly Lys Asn Thr Leu Pro Cys Val Phe Tyr Glu He Asp Arg Glu Leu 
290 295 300 

Pro Arg Leu He Arg Gly Arg Val His Arg Cys Val Gly Asn Tyr Asp 
305 310 315 320 

Gin Lys Lys Asn He Phe Gin Cys Val Ser Val Arg Pro Ala Ser Val 
325 330 335 

Ser . Glu Gin Lys Thr Phe Gin Ala Phe Val Lys He Ala Asp Val Glu 
340 345 350 



Met Gin Tyr Tyr He Asn Val Met Asn Glu Thr 
355 360 



(2) INFORMATION FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 473: 

ser Gin Asp Ser Val Phe Asn Ser He Gin Ser Asn Thr Gly Arg Ser 
1 5 10 15 

Gin Gly Gly Trp Ser Tyr Arg Asp Gly Asn Lys Asn Thr Ser Leu Lys 
20 25 30 



Thr Trp Xaa Lys Asn Asp Phe Lys Pro Gin Cys Lys Arg 
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35 40 45 



(2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474: 

Asn Lys Glu Thr Glu Leu Leu Arg Gin Thr His Ser Ser Lys He Ser 
1 5 10 15 

Gly Cys Thr Met Arg Gly Leu Asp Lys Asn Ser Ala Leu Gin Thr Leu 
20 25 30 

Lys Pro Asn Phe 
35 



(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 

Ser Ser Leu Arg He He Ser Ala Val He Glu Ser Met Lys Tyr Trp 
1 5 10 15 

Arg Glu His Ala Gin Lys Thr Val Leu Leu Phe Glu Val Leu Ala Val 
20 25 30 

Leu Asp Ser * Ala Val Thr Pro Gly Pro Tyr Tyr Ser Lys Thr Phe Leu 
35 40 45 

Met 



(2) INFORMATION FOR SEQ ID NO: 476: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 476: 

Pro Arg Leu He Arg Gly Arg Val His Arg Cys Val Gly Asn Tyr Asp 
15 10 15 

Gin Lys Lys Asn He Phe Gin Cys Val Ser Val Arg Pro Ala Ser Val 
20 25 30 



Ser Glu Gin Lys Thr Phe Gin Ala Phe Val 
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35 



40 



(2) INFORMATION FOR SEQ ID NO: 477: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 477: 

Gly Val Phe Arg Pro Cys Val Cys Gly Arg Pro Ala Ser Leu Thr Cys 
1 5 10 15 

Ser Pro Leu Asp Pro Glu Val Gly Pro Tyr Cys Asp Thr Pro Thr Met 
20 25 30 

Arg Thr Leu Phe Asn Leu Leu Trp Leu Ala Leu Ala Cys Ser Pro Val 
35 40 45 

His Thr Thr Leu Ser Lys Ser Asp Ala Lys Lys Ala Ala Ser Lys Thr 
50 55 60 

Leu Leu Glu Lys Ser Gin Phe Ser Asp Lys Pro Val Gin Asp Arg Gly 
65 70 75 80 

Leu Val Val Thr Asp Leu Lys Ala Glu Ser Val Val Leu Glu His Arg 
85 90 95 

Ser Tyr Cys Ser Ala Lys Ala Arg Asp Arg His Phe Ala Gly Asp Val 
100 105 110 

Leu Gly Tyr Val Thr Pro Trp Asn Ser His Gly Tyr Asp Val Thr Lys 
115 120 125 

Val Phe Gly Ser Lys Phe Thr Gin lie Ser Pro Val Trp Leu Gin Leu 
130 135 140 

Lys Arg Arg Gly Arg Glu Met Phe Glu Val Thr Gly Leu His Asp Val ' 
145 150 155 160 

Asp Gin Gly Trp Met Arg Ala Val Arg Lys His Ala Lys Gly Leu His 
165 170 175 

He Val Pro Arg Leu Leu Phe Glu Asp Trp Thr Tyr Asp Asp Phe Arg 
180 185 190 

Asn Val Leu Asp Ser Glu Asp Glu He Glu Glu Leu Ser Lys Thr Val 
195 200 205 

Val Gin Val Ala Lys Asn Gin His Phe Asp Gly Phe Val val Glu Val 
210 215 220 

Trp Asn Gin Leu Leu Ser Gin Lys Arg Val Gly Leu He His Met Leu 
225 230 235 240 

Thr His Leu Ala Glu Ala Leu His Gin Ala Arg Leu Leu Ala Leu Leu 
245 250 255 
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Val lie Pro Pro Ala lie Thr Pro Gly Thr Asp Gin Leu Gly Met Phe 
260 265 270 

Thr His Lys Glu Phe Glu Gin Leu Ala Pro Val Leu Asp Gly Phe Ser 

275 280 285 

Leu Met Thr Tyr Asp Tyr Ser Thr Ala His Gin Pro Gly Pro Asn Ala 
290 295 300 

Pro Leu Ser Trp Val Arg Ala Cys Val Gin Val Leu Asp Pro Lys Xaa 
305 310 315 , 320 

Lys Trp Arg Thr Lys Ser Ser Trp Gly Ser Thr Ser Met Xaa Trp Thr 
325 330 335 

Xaa Arg Xaa Pro xaa Asp Ala Arg Xaa Pro Val Val Gly Xaa Arg Xaa 
340 345 350 

He Gin Xaa Leu Lys Asp His Xaa Pro Arg Met Val Leu Asp Ser Lys 
355 360 365 

Pro Gin 
370 



(2) INFORMATION FOR SEQ ID NO: 478: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I.ENGTH: 39 amino acids 

<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 

Thr Cys Ser Pro Leu Asp Pro . Glu Val Gly Pro Tyr Cys Asp Thr Pro 
1 5 10 . 15 

Thr Met Arg Thr Leu Phe Asn Leu Leu Trp Leu Ala Leu Ala Cys Ser 
20 25 30 

Pro Val His Thr Thr Leu Ser 
35 



(2) INFORMATION FOR SEQ ID NO: 479: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 

Leu Val Val Thr Asp Leu Lys Ala Glu Ser Val Val Leu Glu His Arg 
15 10 15 

Ser Tyr Cys Ser Ala Lys Ala Arg Asp Arg His Phe Ala Gly Asp Val 
. 20 25 30 

Leu Gly Tyr Val Thr Pro Trp Asn Ser His Gly Tyr Asp Val Thr Lys 
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35 



Val Phe Gly Ser Lys Phe 
50 



40 



45 



(2) INFORMATION FOR SEQ ID NO: 480: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 480: 

Arg Glu Met Phe Glu Val Thr Gly Leu His Asp Val Asp Gin Gly Trp 
15 10 15 

Met Arg Ala Val Arg Lys His Ala Lys Gly Leu His lie Val Pro Arg 
20 25 30 

Leu Leu Phe Glu Asp Trp Thr Tyr Asp Asp Phe Arg Asn Val Leu Asp 
35 40 45 

Ser Glu Asp Glu . 
50 



(2) INFORMATION FOR SEQ ID NO: 481: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481: 

His Phe Asp Gly Phe Val Val Glu Val Trp Asn Gin Leu Leu Ser Gin 
15 10 15 

Lys Arg Val Gly Leu lie His Met Leu Thr His Leu Ala Glu Ala Leu 
20 25 30 

His Gin Ala Arg Leu Leu Ala Leu Leu Val He Pro Pro Ala He Thr 
35 40 - 45 

Pro Gly Thr Asp Gin Leu Gly Met 
50 55 



(2) INFORMATION FOR SEQ ID NO: 482: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 

Asp Gly Phe Ser Leu Met Thr Tyr Asp Tyr Ser Thr Ala His Gin Pro 
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1 5 10 15 

Gly Pro Asn Ala Pro Leu Ser Trp Val Arg Ala Cys Val Gin Val Leu 
20 25 30 

Asp Pro Lys Xaa Lys Trp Arg Thr Lys Ser Ser Trp Gly Ser Thr 
35 "40 45 



(2) INFORMATION FOR SEQ ID NO: 483: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 

Glu Arg Gly Val Ser lie Asn Gin Phe Cys Lys Glu Phe Asn Glu Arg 
1 5 10 • 15 . 

Thr Lys Asp He Lys Glu Gly He Pro Leu Pro Thr Lys He Leu Val 
20 . 25' 30 

Lys Pro Asp Arg Thr Phe Glu He Lys He Gly Gin Pro Thr Val Ser 
35 40 45 

Tyr Phe Leu Lys Ala Ala Ala Gly He Glu Lys Gly Ala Arg Gin Thr 
50 55 60 

Gly Lys Glu Val Ala Gly Leu Val Thr Leu Lys His Val Tyr Glu He 
65 70 75 80 

Ala Arg He Lys Ala Gin Asp Glu Ala Phe Ala Leu Gin Asp Val Pro 
85 90 95 

Leu Ser Ser Val Val Arg Ser He He Gly Ser Ala Arg Ser Leu Gly 
100 "105 , 110 

He Arg Val Val Lys Asp Leu Ser Ser Glu Glu Leu Ala Ala Phe Gin 
115 120 125 

Lys Glu Arg Ala He Phe Leu Ala Ala Gin Lys Glu Ala Asp Leu Ala 
130 135 140 

Ala Gin Glu Glu Ala Ala Lys Lys 
145 150 



(2) INFORMATION FOR SEQ ID NO: 484: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484: 



Ala Val Tyr Thr Tyr His Glu Lys Lys Lys Asp Thr Ala Ala Ser Gly 
1 5 10 15 
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Tyr Gly Thr Gin Asn lie Arg Leu Ser Arg Asp Ala Val Lys Asp Phe 
20 25 30 

5 Asp Cys Cys Cys Leu Ser Leu Gin Pro Cys His Asp Pro Val Val Thr 
35 40 45 



10 



25 



40 



50 



Pro Asp Gly Tyr Leu Tyr Glu Arg Glu Ala He Leu Glu Tyr He Leu 
50 55 60 

His Gin Lys Lys Glu He Ala Arg Gin Met Lys Ala Tyr Glu Lys Gin 
65 70 -75 80 



Arg Gly Thr Arg Arg Glu Glu Gin Lys Glu Leu Gin Arg Ala Ala Ser 
15 85 • 90 95 

Gin Asp His Val Arg Gly Phe Leu Glu Lys Glu Ser Ala He Val Ser 
100 105 110 

20 Arg Pro Leu Asn Pro Phe Thr Ala Lys Ala Leu Ser Gly Thr Ser Pro 
115 120 125 



Asp Asp val Gin Pro Gly Pro Ser Val Gly Pro Pro Ser Lys Asp Lys 
130 135 140 

Asp Lys Val Leu Pro Ser Phe Trp He Pro Ser Leu Thr Pro Glu Ala 
145 150 155 160 



Lys Ala Thr Lys Leu Glu Lys Pro Ser Arg Thr Val Thr Cys Pro Met 
30 165 170 175 

Ser Gly Lys Pro Leu Arg Met Ser Asp Leu Thr Pro Val His Phe Thr 
180 185 190 

35 Pro Leu Asp Ser Ser Val Asp Arg Val Gly Leu He Thr Arg Ser Glu 
195 200 205 



Arg Tyr Val Cys Ala Val Thr Arg Asp Ser Leu Ser Asn Ala Thr Pro 
210 215 220 

Cys Ala Val Leu Arg Pro Ser Gly Ala Val Val Thr Leu Glu Cys Val 
225 230 235 240 



Glu Lys Leu He Arg Lys Asp Met Val Asp Pro Val Thr Gly Asp Lys 
45 245 250 255 



Leu Thr Asp Arg Asp He He Val Leu Gin Arg Gly Gly Thr 
260 265 270 



(2) INFORMATION FOR SEQ ID NO: 485: 



(i) SEQUENCE CHAi?ACTERISTICS : 
55 (A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485: 

60 Tyr Leu Tyr Glu Arg Glu Ala He Leu Glu Tyr He Leu His Gin Lys 
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5 



1 5 10 15 

Lys Glu He Ala Arg Gin Met Lys Ala Tyr Glu Lys Gin Arg Gly Thr 
20 25 30 

Arg Arg Glu Glu Gin Lys Glu Leu Gin Arg Ala Ala Ser Gin Asp His 
35 40 45 



Val Arg Gly Phe Leu Glu 
10 50 



15 



(2) INFORMATION FOR SEQ ID NO: 486: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 486: 

Phe Thr Ala Lys Ala Leu Ser Gly Thr Ser Pro Asp Asp Val Gin Pro 
1 5 10. 15 

25 Gly Pro Ser Val Gly Pro Pro Ser Lys Asp Lys Asp Lys Val Leu Pro 
20 25 30 



30 



35 



40 



Ser Phe Trp He Pro Ser Leu Thr Pro Glu Ala Lys Ala Thr Lys Leu 
35 40 45 

Glu Lys Pro Ser Arg Thr Val Thr Cys Pro Met Ser Gly Lys Pro Leu 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 487: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487: 

Val His Phe Thr Pro Leu Asp Ser Ser Val Asp Arg Val Gly Leu He 
1 5 . 10 15 

50 Thr Arg Ser Glu Arg Tyr Val Cys Ala Val Thr Arg Asp Ser Leu Ser 
20 25 30 

Asn Ala Thr Pro Cys Ala Val Leu Arg Pro Ser Gly Ala Val Val Thr 
35 40 45 

55 

Leu Glu Cys Val Glu Lys Leu He 
50 55 



60 
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(2) INFORMATION FOR SBQ ID NO: 488: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 567 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 

Met Asp Thr Ser Glu Asn Arg Pro Glu Asn Asp Val Pro Glu Pro Pro 
1 5 10 15 

Met Pro He Ala Asp Gin Val Ser Asn Asp Asp Arg Pro Glu Gly Ser 
20 25 30 

Val Glu Asp Glu Glu Lys Lys Glu Ser Ser Leu Pro Lys Ser Phe Lys 
35 40 45 

Arg Lys He Ser Val Val Ser Ala Thr Lys Gly Val Pro Ala Gly Asn 
50 55 60 

Ser Asp Thr Glu Gly Gly Gin Pro Gly Arg Lys Arg Arg Trp Gly Ala 
65 70 75 80 

Ser Thr Ala Thr Thr Gin Lys Lys Pro Ser He Ser He Thr Thr Glu 
85 90 95 

Ser Leu Lys Ser Leu He Pro Asp He Lys Pro Leu Ala Gly Gin Glu 
100 105 110 

Ala Val Val Asp Leu His Ala Asp Asp Ser Arg He Ser Glu Asp Glu 
115 120 125 

Thr Glu Arg Asn Gly Asp Asp Gly Thr His Asp Lys Gly Leu Lys He 
130 135 140 

Cys Arg Thr Val Thr Gin Val Val Pro Ala Glu Gly Gin Glu Asn Gly 
145 150 155 160 

Gin Arg Glu Glu Glu Glu Glu Glu Lys Glu Pro Glu Ala Glu Pro Pro 
165 170 175 

Val Pro Pro Gin Val Ser Val Glu Val Ala Leu Pro Pro Pro Ala Glu 
180 185 ^ 190 . 

His Glu Val Lys Lys Val Thr Leu Gly Asp Thr Leu Thr Arg Arg Ser 
195 200 205 

He Ser Gin Gin Lys Ser Gly Val Ser He Thr He Asp Asp Pro Val 
210 215 220 

Arg Thr Ala Gin Val Pro Ser Pro Pro Arg Gly Lys He Ser Asn He 
225 230 235 240 

Val His He Ser Asn Leu Val Arg Pro Phe Thr Leu Gly Gin Leu Lys 
245 250 255 

Glu Leu Leu Gly Arg Thr Gly Thr Leu Val Glu Glu Ala Phe Trp He 
260 265 270 

Asp Lys He Lys Ser His Cys Phe Val Thr Tyr Ser Thr Val Glu Glu 
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275 280 285 

Ala Val Ala Thr Arg Thr Ala Leu His Gly Val Lys Trp Pro Gin Ser 
290 295 300 

5 

Asn Pro Lys Phe Leu Cys Ala Asp Tyr Ala Glu Gin Asp Glu Leu Asp 
305 310 315 320 

Tyr His Arg Gly Leu Leu Val Asp Arg Pro Ser Glu Thr Lys The Glu 
10 325 330 335 

Glu Gin Gly lie Pro Arg Pro Leu His Pro Pro Pro Pro Pro Pro Val 
340 345 350 

15 Gin Pro Pro Gin His Pro Arg Ala Glu Gin Arg Glu Gin Glu Arg Ala 
355 360 365 



20 



35 



Val Arg Glu Gin Trp Ala Glu Arg Glu Arg Glu Met Glu Arg Arg Glu 
370 375 380 

Arg Thr Arg Ser Glu Arg Glu Trp Asp Arg Asp Lys Val Arg Glu Gly 
385 390 395 400 



Pro Arg Ser Arg Ser Arg Ser Arg Xaa Arg Arg Arg Lys Glu Arg Ala 
25 405 410 415 

Lys Ser Lys Glu Lys Lys Ser Glu Lys Lys Glu Lys Ala Gin Glu Glu 
420 425 430 

30 Pro Pro Ala Lys Leu Leu Asp Asp Leu Phe Arg Lys Thr Lys Ala Ala 
435 440 445 



Pro Cys lie Tyr Trp Leu Pro Leu Thr Asp Ser Gin lie Val Gin Lys 
450 455 460 

Glu Ala Glu Arg Ala Glu Arg Ala Lys Glu Arg Glu Lys Arg Arg Lys 
465 470 475 480 



Glu Gin Glu Glu Glu Glu Gin Lys Glu Arg Glu Lys Glu Ala Glu Arg 
40 485 490 495 

Glu Arg Asn Arg Gin Leu Glu Arg Glu Lys Arg Arg Glu His Ser Arg 
500 505 510 

45 Glu Arg Asp Arg Glu Arg Glu Arg Glu Arg Glu Arg Asp Arg Gly Asp 
515 520 525 

Arg Asp Arg Asp Arg Glu Arg Asp Arg Glu Arg Gly Arg Glu Arg Asp 
530 535 540 

50 

Arg Arg Asp Thr Lys Arg His Ser Arg Ser Arg Ser Arg Ser Thr Pro 
545 550 555 560 

Val Arg Asp Asrg Gly Gly Arg 
55 565 



(2) INFORMATICMl FOR SEQ ID NO: 489: 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LEtXTTH: 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 

Gly Cys Asp Ser Cys Pro Pro His Leu Pro Arg Glu Ala Phe Ala Gin 
15 10 15 

Asp Thr Gin Ala Glu Gly Glu Cys Ser Ser Arg Ala Glu Arg Ala Asp 
20 25 30 

Met Cys Pro Asp Ala Pro Pro Ser Gin Glu Val Pro Glu Gly Pro Gly 
35 40 45 

Ala Ala Pro 
50 



(2) INFORMATION FOR SEQ ID NO: 490: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear. 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 490: 

Pro Gin Leu Pro Ser Cys Gly Arg Pro Trp Pro Gly Thr Ala Ser Val 
15 10 15 

Phe Gin Ser His Thr Gin Gly Pro Arg Glu Asp Pro Asp Pro Cys Arg 
20 25 30 

Ala Gin Gly Ser Ala Gly Thr His Cys Pro lie Ser Leu Ser Pro Pro 
35 40 45 

Arg Gin 
50 



(2) INFORMATION FOR SEQ ID NO: 491: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491: 

Pro Gly Phe Arg Gly Pro Ser Gly Ser Leu Gly Cys Ser Phe Phe Pro 
15 10 15 

Arg Ser Leu Gly Arg Val Leu Pro Pro Gly Cys Gin Aig Pro Gly Ala 
20 25 30 

His Ala Asp Ser Ser Pro Pro Pro Thr Pro 
35 . 40 
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(2) INFORMATION FOR SEQ ID NO: 492: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 492: 

Glu Asp Leu Lys Lys Pro Asp Pro Ala Ser Leu Arg Ala Ala Ser Cys 
15 10 15 

Gly Glu Gly Lys Lys Arg Lys Ala Cys Lys Asn Cys Thr .Cys Gly Leu 
20 25 30 • 

Ala Glu Glu Leu Glu Lys Glu Lys Ser Arg Glu Gin Met Ser Ser Gin 
35 40 45 

Pro Lys Ser Ala Cys Gly Asn Cys Tyr Leu Gly Asp Ala Phe Arg Cys 
50 55 60 

Ala Ser Cys Pro Tyr Leu Gly Met Pro Ala Phe Lys Pro Gly Glu Lys 
65 . 70 .75 80 

Val Leu Leu Ser 



(2) INFORMATION FOR SEQ ID NO: 493: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 

Glu Asp Leu Lys Lys Pro Asp Pro Ala Ser Leu Arg Ala Ala Ser Cys 
1 5 10 15 

Gly Glu Gly Lys Lys Arg Lys Ala Cys Lys Asn Cys Thr Cys Gly Leu 
20 25 30 

Ala Glu Glu Leu Glu Lys Glu Lys Ser Arg Glu Gin Met Ser Ser Gin 
35 40 45 

Pro Lys Ser Ala Cys Gly Asn Cys Tyr Leu Gly Asp Ala Phe Arg Cys 
50 55 ^0 . 

Ala Ser Cys Pro Tyr Leu Gly Met Pro Ala Phe Lys Pro Gly Glu Lys 
65 70 75 80 

Val Leu Leu Ser Asp Ser Asn Leu His Asp 
85 90 



(2) INFORMATION FOR SEQ ID NO: 494: 
(1) SEQUENCE CHARACTERISTICS: 
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- (A) LENGTH: 34 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494: 

Cys Gly Asn Cys Tyr Leu Gly Asp Ala Phe Arg Cys Ala Ser Cys Pro 
1 5 ' 10 15 

Tyr Leu Gly Met Pro Ala Phe Lys Pro Gly Glu Lys Val Leu Leu Ser 
20 25 30 . 

Asp Ser 



(2) INFORMATION FOR SEQ ID NO: 495: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495: 

Ser Cys Gly Glu Gly Lys Lys Arg Lys Ala Cys Lys Asn Cys Thr Cys 
15 10 15 

Gly Leu Ala Glu Glu Leu Glu Lys Glu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 496: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 

Ser Gin Pro Lys Ser Ala Cys Gly Asn Cys Tyr Leu Gly Asp Ala Phe 
1 5 10 15 

Arg Cys Ala Ser Cys 
20 



(2) INFORMATIC»« FOR SEQ ID NO: 497: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

{3d) SEQUENCE DESCRIPTION: SEQ ID NO: 497: 

Arg Glu Ala Gly Gin Asn Ser Glu Arg Gin Tyr Val Ser Leu Ser Arg 
1 5 10 15 . 



Asp 
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(2) INFORMATION FOR SEQ ID NO: 498: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498: 

Glu Ser Ser Gly Gin Ala Arg Thr Leu Ala Asp Pro Gly Pro Gly Trp 
15 10 15 

Pro Arg Gin Gin Gly Met Cys Phe Gly Ser Leu Thr Gly Leu Ser Thr 
20 25 30 

Thr Pro His Gly Phe Leu Thr Val Ser Ala Glu Ala Asp Pro Arg Leu 
35 40 45 

lie Glu Ser Leu Ser Gin Met Leu Ser Met Gly Phe Ser Asp Glu Gly 
50 55 60 

Gly Trp Leu Thr Arg Leu Leu Gin Thr Lys Asn Tyr Asp lie Gly Ala 
65 70 75 80 

Ala Leu Asp Thr lie Gin Tyr Ser Lys His 
85 90 



(2) INFORMATION FOR SEQ ID NO: 499: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499: 

Gin Glu Gly Ser Glu Pro Val Leu Leu Glu Gly Glu Cys Leu val Val 
15 10 15 

Cys Glu Pro Gly Arg Ala Ala Ala Gly Gly Pro Gly Gly Ala Ala Leu 
20 25 30 

Gly Glu Ala Pro Pro Gly Arg Val Ala Phe Xaa Ala Val Arg Ser His 
35 40 45 

His His Glu Pro Ala Gly Glu Thr Gly Asn Gly Thr Ser Gly Ala He 
50 55 60 

Tyr Phe Asp Gin Val Leu Val Asn Glu Gly Gly Gly Phe Asp Arg Ala 
65 70 75 80 

Ser Gly Ser Phe Val Ala Pro Val Arg Gly Val Tyr Ser Phe Arg Phe 
85 90 95 

His Val Val Lys Val Tyr Asn Arg Gin Thr Val Gin Val Ser Leu Met 
100 105 110 
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Leu Asn Thr Trp Pro Val lie Ser Ala Phe Ala Asn Asp Pro Asp Val 
115 120 125 

Thr Arg Glu Ala Ala Thr Ser Ser Val Leu Leu Pro Leu Asp Pro Gly 
130 135 140 

Asp Arg Val Ser. Leu Arg Leu Arg Arg Gly Xaa Ser Thr Gly Trp 
145 150 155 



(2) INFOroiATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 

Pro Arg Ser Arg Pro Ala Leu Arg Pro Gly Arg Gin Arg Pro Pro Ser 

1 5 . 10 15 

His Ser Ala Thr Ser Gly Val Leu Arg Pro Arg Lys Lys Pro Asp Pro 
20 25 30 



<2) INFORMATION FOR SEQ ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 

Met Thr Leu He Thr Pro Ser Xaa Lys Leu Thr Phe Xaa Lys Gly Asn 
15 10 15 

Lys Ser Trp Ser Ser Arg Ala Cys Ser Ser Thr Leu Val Asp Pro 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 502: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY:, linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 

Gly His Pro Ser Pro Ala Leu Ser He Ala Pro Ser Asp Gly Ser Gin 
15 10 15 



Leu Pro Cys Asp Glu Val Pro Tyr Gly Glu Ala His Val Thr Arg Tyr 
20 25 , 30 
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Cys Lys Lys Pro Leu Thr Asn Ser His Leu Glu Thr Glu Ala Gin Ser 
35 40 45 

Ser Ser Leu 
50 



10 (2) INFOKMATION FOR SEQ ID NO: 503: • 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SBQ ID NO: 503: 

20 GCTTCGTCTC CAACCCTCTT GCCCTTCGCC TGTGTGCCTG GAGCCAGTCC CACCACGCTC 60 

GCGTTTCCTC CTOrACTCCT CACAGGTCCC AGCACCGATG GCATTCCCTT TGCCCTGAGT 120 

CTCCAGCGGG TCCCTTTTGT GCTTCCTTCC CCTCAGGTAG CCTCTCTCCC CCTGGGCCAC 180 

TCCCGGGGGT GAGGGGGTTA CCCCTTCCCA GTGTTTTTTA TTCCTGTGGG GCTCACCCCA 240 

AAGTATTAAA AGTAGCTTTG TAA 263 



25 



30 



(2) INFORMATIC»r FOR SBQ ID NO: 504: 



35 ' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 
GCTTCGTCTC CAACCCTCTT GCCCTTCGCC TGTGTGCCTG GAGCCAGTCC CACCACGCTC 60 
45 GCGTTTCCTC CTOTAGTGCT CACAGGTCCC AGCACCGATG GCATTCCCTT TGCCCTGAGT 120 
CTCCAGCGGG TCCCTTTTGT GCTTCCTTCC CCTCAGGTAG CCTCTCTCCC CCTGGGCCAC 180 
TCCCGGGGGT GAGGGGGTTA CCCCTTCCCA GTGTTTTTTA TTCCTGTGGG GCTCACCCCA 240 

50 

AAGTATTAAA AGTAGCTTTG TAA 263 



55 

(2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505: 
GCrrCGTCTC CAACCCTCTT GCCCTTCGCC TGTGTGCCTG GAGCCAGTCC CACCACGCTC 60 
GCGTTTCCTC CTGTAGTGCT CACAC3GTCCC AGCACCGATG GCATTCCCTT TGCCCTGAGT 120 
10 CTCCAGCGC3G TCCCmTGT GCTTCCTTCC CCTCAGGTAG CCTCTCTCCC CCTGGGCCAC 180 
TCCCGGC3C3GT GAGGGGGTTA OCCCTTCCCA GTGTTTTTTA TTCCTGTGGG GCTCACCCCA 240 
AAGTATTAAA AGTAGCTTTG TAA 

15 



253 



20 



30 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 506: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dovible 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 

TGGCTCACTG TCTTACAATC ACTCCTGTGG AATCATGATA CCACTTTTAG CTCTTTGCAT 60 

CTTCCTTCAG TOTATTTTTG TTTTTCAAGA GGAAGTAGAT TTTAACTGGA CAACTTTGAG 120 

TACTCACATC ATTGATAAAT AAACTGGCTT GTGGTTTCAA 160 



(2) INFORMATION FOR SEQ ID NO: 507: 



40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 292 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 



Leu Asp Glu Leu Met Ala His Leu Thr Glu Met Gin Ala Lys Val Ala 
1 5 10 15 



Val Arg Ala Asp Ala Gly Lys Lys His Leu Pro Asp Lys Gin Asp His 
50 20 25 30 

Lys Ala Ser Leu Asp Ser Met Leu Gly Gly Leu Glu Gin Glu Leu Gin 
35 40 45 

55 Asp Leu Gly lie Ala Thr Val Pro Lys Gly His Cys Ala Ser Cys Gin 
50 55 60 



60 



Lys Pro He Ala Gly Lys Val He His Ala Leu Gly Gin Ser Trp His 
65 70 75 80 
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Pro Glu His Phe Val Cys Thr His Cys Lys Glu Glu lie Gly Ser Ser 
85 90 95 

Fro Phe Phe Glu Arg Ser Gly Leu Xaa Tyr Cys Pro Asn Asp Tyr His 
5 100 105 110 

Gin Leu Phe Ser Pro Arg Cys Ala Tyr Cys Ala Ala Pro lie Leu Asp 
115 120 125 

10 Lys Val Leu Thr Ala Met Asn Gin Thr Trp His Pro Glu His Phe Phe 
130 135 140 



15 



30 



55 



Cys Ser His Cys Gly Glu Val Phe Gly Ala Glu Gly Phe His Glu Lys 
145 150 155 160 

Asp Lys Lys Pro Tyr Cys Arg Lys Asp Phe Leu Ala Met Phe Ser Pro 
165 170 175 



Lys Cys Gly Gly Cys Asn Arg Pro Val Leu Glu Asn Tyr Leu Ser Ala 
20 180 185 190 

Met Asp Thr Val Trp His Pro Glu Cys Phe Val Cys Gly Asp Cys Phe 
195 200 205 

25 Thr Ser Phe Ser Thr Gly Ser Phe Phe Glu Leu Asp Gly Arg Pro Phe 
210 215 220 



Cys Glu Leu His Tyr His His Arg Arg Gly Thr Leu Cys His Gly Cys 
225 230 235 240 

Gly Gin Pro lie Thr Gly Arg Cys lie Ser Ala Met Gly Tyr Lys Phe 
245 250 255 



His Pro Glu His Phe Val Cys Ala Phe Cys Leu Thr Gin Leu Ser Lys 
35 260 265 270 

Gly He Phe Arg Glu Gin Asn Asp Lys Thr Tyr Cys Gin Pro Cys Phe 
275 280 285 

40 Asn Lys Leu Phe 
290 



45 (2) INFORMATION FOR SEQ ID NO: 508: 

(i) SEQUENCE CHARACTEEUSTICS : 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 

Lys Ala Ser Leu Asp Ser Met Leu Gly Gly Leu Glu Gin Glu Leu Gin 
15 10 15 

Asp Leu Gly He Ala Thr Val Pro Lys Gly His Cys Ala Ser Cys Gin 
20 25 30 



Lys Pro He Ala Gly Lys Val He His Ala Leu 
60 35 40 
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(2) INFORMATION FOR SBQ ID NO: 509: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: aniino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 

Cys Pro Asn Asp Tyr His Gin Leu Phe Ser Pro Arg Cys Ala Tyr Cys 
15 10 15 

15 Ala Ala Pro lie Leu Asp Lys Val Leu Thr Ala Met Asn Gin Thr Trp 
20 25 30 



20 



25 



45 



50 



His Pro Glu His Phe Phe Cys Ser His Cys Gly Glu Val Phe Gly Ala 
35 40 45 



Glu Gly 
50 



(2) INFORMATION FOR SEQ ID NO: 510: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 

Asp Lys Lys Pro Tyr Cys Arg Lys Asp Phe Leu Ala Met Phe Ser Pro 
35 1 5 10 15 

Lys Cys Gly Gly Cys Asn Arg Pro Val Leu Glu Asn Tyr Leu Ser Ala 

20 ' " 25 30 

40 Met Asp Thr Val Trp His Pro Glu Cys Phe Val Cys Gly Asp Cys Phe 
35 40 45 



Thr Ser Phe Ser Thr Gly Ser Phe Phe Glu Leu Asp Gly Arg Pro Phe 
50 55 60 

Cys Glu Leu 
65 



(2) INFORMATION FOR SEQ ID NO:^ 511: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 
55 <B) TYPE: amino acid 

<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511: 

Cys Gly Gin Pro lie Thr Gly Arg Cys lie Ser Ala Met Gly Tyr Lys 
60 1 5 10 15 
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Phe His Pro Glu His Phe Val Cys Ala Phe Cys Leu Thr Gin Leu Ser 
20 25 30 



5 Lys Gly lie Phe Arg Glu Gin Asn Asp Lys Thr Tyr Cys Gin 
35 40 45 



10 (2) INFORMATION FOR SEQ ID NO: 512: 

(i) SEQUENCE CHARACTERISTICS: 

(A) 'LENGTH: 452 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 



20 



35 



50 



Met Gly Ser Ser Gin Ser Val Glu He Pro Gly Gly Gly Thr Glu Gly 
1 5 10 15 

Tyr His Val Leu Arg Val Gin Glu Asn Ser Pro Gly His Arg Ala Gly 
20 25 30 



Leu Glu Pro Phe Phe Asp Phe He Val Ser He Asn Gly Ser Arg Leu 
25 35 40 45 

Asn Lys Asp Asn Asp Thr Leu Lys Asp Leu Leu Lys Xaa Asn Val Glu 
50 55 60 

30 Lys Pro Val Lys Met Leu He Tyr Ser Ser Lys Thr Leu Glu Leu Arg 

65 70 75 80 . 



Glu Thr Ser Val Thr Pro Ser Asn Leu Trp Gly Gly Gin Gly Leu Leu 
85 90 95 

Gly val Ser He Arg Phe Cys Ser Phe Asp Gly Ala Asn Glu Asn Val 
100 105 110 



Trp His Val Leu Glu Val Glu Ser Asn Ser Pro Ala Ala Leu Ala Gly 
40 115 120 125 

Leu Arg Pro His Ser Asp Tyr He He Gly Ala Asp Thr Val Met Asn 
130 135 140 

45 Glu Ser Glu Asp Leu Phe Ser Leu He Glu Thr His Glu Ala Lys Pro 
145 150 155 160 



Leu Lys Leu Tyr Val Tyr Asn Thr Asp Thr Asp Asn Cys Arg Glu Val 
165 170 175 

He He Thr Pro Asn Ser Ala Trp Gly Gly Glu Gly Ser Leu Gly Cys 
180 185 190 



Gly He Gly Tyr Gly Tyr Leu His Arg He Pro Thr Arg Pro Phe Glu 
55 195 200 205 

Glu Gly Lys Lys He Ser Leu Pro Gly Gin Met Ala Gly Thr Pro He 
210 215 220 

60 Thr Pro Leu Lys Asp Gly. Phe Thr Glu Val Gin Leu Ser Ser Val Asn 
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225 



230 



235 



240 



Pro Pro Ser Leu Ser Pro Pro Gly thr Thr Gly He Glu Gin Ser Leu 
245 250 255 

Thr Gly Leu Ser He Ser Ser Thr Pro Pro Ala Val Ser Ser Val Leu 
260 265 270 

Ser Thr Gly Val Pro Thr Val Pro Leu Leu Pro Pro Gin Val Asn Gin 
275 280 • 285 

Ser Leu Thr Ser Val Pro Pro Met Asn Pro Ala Thr Thr Leu Pro Gly 
290 295 300 

Leu Met Pro Leu Pro Ala Gly Leu Pro Asn Leu Pro Asn Leu Asn Leu 
305 310 315 320 

Asn Leu Pro Ala Pro His He Met Pro Gly Val Gly Leu Pro Glu Leu 
325 330 335 

Val Asn Pro Gly Leu Pro Pro Leu Pro Ser Met Pro Pro Arg Asn Leu 
340 345 350 

Pro Gly He Ala Pro Leu Pro Leu Pro Ser Glu Phe Leu Pro Ser Phe 
355 360 365 

Pro Leu Val Pro Glu Ser Ser Ser Ala Ala Ser Ser Gly Glu Leu Leu 
370 375 380 

Ser Ser Leu Pro Pro Thr Ser Asn Ala Pro Ser Asp Pro Ala Thr Thr 
385 390 395 400 

Thr Ala Lys Ala Asp Ala Ala Ser Ser Leu Thr Val Asp Val Thr Pro 
405 410 415 

Pro Thr Ala Lys Ala Pro Thr Thr Val Glu Asp Arg Val Gly Asp Ser 
420 425 430 

Thr Pro Val Ser Glu Lys Pro Val Ser Ala Ala Val Asp Ala Asn Ala 
435 440 445 



Ser Glu Ser Pro 
450 



(2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 

Ser Val Glu He Pro Gly Gly Gly Thr Glu Gly Tyr His Val Leu Arg 
1 5 ■ 10 15 



Val Gin Glu Asn Ser Pro Gly His Arg Ala Gly Leu Glu Pro Phe Phe. 
20 25 30 
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Asp Phe He Val Ser lie Asn Gly Ser Arg Leu Asn Lys Asp Asn Asp 
35 40 • 45 

Thr Leu Lys Asp Leu Leu Lys Xaa Asn Val Glu Lys Pro Val Lys Met 
50 55 60 

Leu He Tyr Ser Ser Lys Thr Leu Glu Leu Arg Glu Thr Ser Val Thr 
65 70 75 80 

Pro Ser Asn Leu Trp Gly Gly Gin Gly Leu Leu Gly Val Ser He Arg 
85 90 95 

Phe Cys Ser Phe Asp Gly Ala Asn Glu Asn Val Trp His 
100 105 



(2) INFORMATION FOR SEQ ID NO: 514: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acic3s 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 514: 

Glu Ser Asn Ser Pro Ala Ala Leu Ala Gly Leu Arg Pro His Ser Asp 
15 10 15 

Tyr He He Gly Ala Asp Thr Val Met Asn Glu Ser Glu Asp Leu Phe 
20 .25 30 

Ser Leu He Glu Thr His Glu Ala Lys Pro Leu Lys Leu Tyr Val Tyr 
35 40 45 

Asn Thr Asp Thr Asp Asn Cys Arg Glu Val He He Thr Pro Asn Ser 
50 55. 60 

Ala Trp Gly Gly Glu Gly Ser Leu Gly Cys Gly He Gly Tyr Gly Tyr 
65 .70 75 80 

Leu His Arg He Pro Thr Arg Pro Phe Glu Glu Gly Lys Lys He Ser 
85 90 95 

Leu Pro Gly Gin Met Ala Gly Thr Pro He Thr Pro Leu Lys Asp Gly 
100 105 110 

Phe Thr Glu Val Gin Leu Ser Ser Val Asn Pro Pro Ser Leu Ser Pro 
115 120 125 

Pro Gly Thr Thr Gly He Glu Gin Ser Leu Thr Gly Leu Ser He Ser 
130 135 140 

Ser 
145 



(2) INFORMATION FOR SEQ ID NO: 515: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 

Glu Ser Asn Ser Pro Ala Ala Leu Ala Gly Leu Arg Pro His Ser Asp 

15 10 15 

Tyr lie He Gly Ala Asp Thr Val Met Asn Glu Ser Glu Asp Leu Phe 
20 25 30 

Ser Leu He Glu Thr His Glu Ala Lys Pro Leu Lys Leu Tyr Val Tyr 
35 40 45 

Asn Thr Asp Thr Asp Asn Cys Arg Glu Val He He Thr Pro Asn Ser 
50 . 55 60 

Ala Trp Gly Gly Glu Gly Ser Leu Gly Cys Gly He Gly Tyr Gly Tyr 
65 70 75 80 

Leu His Arg He Pro Thr Arg Pro Phe Glu Glu Gly Lys Lys He Ser 
85 90 95 

Leu Pro Gly Gin Met Ala Gly Thr Pro He Thr Pro Leu Lys Asp Gly 
100 105 110 

Phe Thr Glu Val Gin Leu Ser Ser Val Asn Pro Pro Ser. Leu Ser Pro 
115 120 125 

Pro Gly Thr Thr Gly He Glu Gin Ser Leu Thr Gly Leu Ser He Ser 
130 135 140 

Ser 
145 



(2) INFORMATION FOR SEQ ID NO: 516: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 

Arg He Pro Thr Arg Pro Phe Glu Glu Gly Lys Lys He Ser Leu Pro 
15 10 15 

Gly Gin Met Ala Gly Thr Pro He Thr Pro Leu Lys Asp Gly Phe Thr 
20 25 30 

Glu Val Gin Leu Ser Ser Val Asn Pro Pro Ser Leu Ser Pro Pro Gly 
35 40 45 

Thr Thr Gly He Glu Gin Ser Leu Thr Gly Leu Ser He Ser Ser Thr 
50 55 60 



Pro Pro iUa Val Ser Ser Val Leu Ser Thr Gly Val Pro Thr Val Pro. 
65 70 75 80 
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Leu Leu Pro Pro Gin Val Asn Gin Ser Leu Thr Ser Val Pro Pro Met 
85 90 95 

^Asn Pro Ala Thr Thr Leu Pro Gly Leu Met Pro Leu Pro Ala Gly Leu 
100 105 110 

Pro Asn Leu Pro Asn Leu Asn Leu Asn Leu Pro Ala Pro His lie Met 
115 120 125 

Pro Gly Val Gly Leu Pro Glu Leu Val Asn Pro Gly Leu Pro Pro Leu 
130 135 . 140 



Pro Ser Met Pro Pro Arg Asn 
145 150 



(2) INFORMATION FOR SEQ ID NO: 517: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 

Pro Gly Leu Pro Pro Leu Pro Ser Met Pro Pro Arg Asn Leu Pro Gly 
15 10 15 

lie Ala Pro Leu Pro Leu Pro Ser Glu Phe Leu Pro Ser Phe Pro Leu 
20 25 30 

Val Pro Glu Ser Ser Ser Ala Ala Ser Ser Gly Glu Leu Leu Ser Ser 
35 40 45 

Leu Pro Pro Thr Ser Asn Ala Pro Ser Asp Pro Ala Thr Thr Thr Ala 
50 55 60 

Lys Ala Asp Ala Ala Ser Ser Leu Thr Val Asp Val Thr Pro Pro Thr 
65 70 75 80 

■ Ala Lys Ala Pro Thr Thr Val Glu Asp Arg Val Gly Asp Ser Thr Pro 
85 90 95 

Val Ser Glu Lys Pro Val Ser Ala Ala Val Asp Ala Asn 
100 105 



(2) INFORMATION FOR SEQ ID NO: 518: 

(i) SEQUENCE CHARACTEEaSTICS : 

(A) LS3GTK: 93 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

lie Tyr Lys Val Phe Arg His Thr Ala Gly Leu Lys Pro Glu Val Ser 
1 5 10 15 ^ 

Cys Phe Glu Asn He Arg Ser Cys Ala Arg Xaa Xaa Xaa Xaa Xaa Xaa 
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20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Trp lie Phe Gly Val Leu His Val Val His 
35 40 45 

5 

Ala Ser Val Val Thr Ala Tyr Leu Phe Thr Val Ser Asn Ala Phe Gin 
50 55 60 

Gly Met Phe lie Phe Leu Phe Leu Cys Val Leu Ser Arg Lys lie Gin 
10 65 70 75 80 

Glu Glu Tyr Tyr Arg Leu Phe Lys Asn Val Pro Cys Cys 
85 90 



15 



(2) INFORMATION FOR SEQ ID NO: 519: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 

25 Trp He Phe Gly Val Leu His Val Val His Ala Ser Val Val Thr Ala 
15 10 15 

Tyr Leu Phe Thr Val Ser Asn Ala Phe Gin Gly Met Phe He Phe Leu 
20 25 30 

30 

Phe Leu Cys Val Leu Ser Arg Lys He Gin Glu Glu Tyr Tyr Arg Leu 
35 40 45 

Phe Lys Asn Val Pro Cys Cys 
35 50 55 



(2) INFORMATION FOR SEQ ID NO: 520: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 

Ala Leu Thr Arg He Pro Pro Gly Asp Trp Val He Asn Val Thr Ala 
1 5 10 15 

SO Val Ser Phe Ala Gly Lys Thr Thr Ala Arg Phe Phe Xaa His Ser Ser 
20 25 30 

Pro Pro Ser Leu Gly Asp Gin Ala Arg Thr Asp Pro Gly His Gin Arg 
35 40 45 

55 

Arg Asp 
50 



60 
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(2) INFORMATION FOR SEQ ID NO: 521: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 

Leu Gin Glu Val Asn He Thr Leu Pro Glu Asn Ser Val Trp Tyr Glu 
15 10 15 

Arg Tyr Lys Phe Asp He Pro Val Phe His Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

<B) TYPE: amino acid 

(D) TOPOU)GY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 

Met Gin Gly Ser Gly Ser Gin Phe Arg Ala Cys Leu Leu Cys Leu Cys 
15 10 15 

Phe Ser Cys Pro Cys Ser Pro Gly Gly Pro Arg Trp Asn Ser Arg Gin 
20 25 30 

Gly Gly Arg Arg Phe Pro Lys Thr Cys Arg Ala He Ser Gin Asn Leu 
35 40 45 

Val Phe Lys Tyr Lys Thr Phe Cys Pro Val Arg Tyr Met Gin Pro His 
50 55 . 60 

Arg Ser Ser Leu Cys Leu His Phe Thr Ser Tyr Val Phe He Leu Ser 
65 70 * 75 80 

Thr Trp Gly Ser Leu Arg Thr Tyr Ser Thr Asp Leu Lys Lys Lys Lys 
85 90 95 

Lys Asn Ser Arg Gly Gly Pro Val Pro He Arg Pro Lys Ser 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 523: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523: 
TAGCATGTAG CCAGTCGAAT AACNIATAAG GACAAAGTGG AGTCCACX3CG 
TAGACTAGTG GATCCCCCX3G CTGCAGGATT CGGCACGAG 
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(2) INFORMATION FOR SEQ ID NO: 524: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51. amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 

Met Gin Gly Ser Gly Ser Gin Phe Arg Ala Cys Leu Leu Cys Leu Cys 
15 10 15 

Phe Ser Cys Pro Cys Ser Pro Gly Gly Pro Arg Trp Asn Ser Arg Gin 
20 25 30 

Gly Gly Arg Arg Phe Pro Lys Thr Cys Arg Ala He Ser Gin Asn Leu 
35 40 45 

Val Phe Lys 
50 



(2) INFORMATION FOR SEQ ID NO: 525: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acicSs 

(B) TYPE: amino acid 
(D) T0P0LCX3Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 

Pro Val Arg Tyr Met Gin Pro His Arg Ser Ser Leu Cys Leu His Phe 
15 10 15 

Thr Ser Tyr Val Phe He Leu Ser Thr Trp Gly Ser Leu Arg Thr Tyr 
20 25 30 

Ser Thr Asp Leu Lys Lys Lys Lys Lys Asn Ser Arg Gly Gly Pro Val 
35 40 45 

Pro He Arg Pro Lys Ser 
50 



(2) INFORMATi™ FOR SEQ ID NO: 526: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid " 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 

Gly Glu Glu Gin Arg Asp Cys Ser Leu Gly Trp Arg Gly Val Gly Met 
1 5 10 15 ' 



Arg Ala Thr His Cys Gin Ala Ala Arg Met Phe Val Leu Phe Ser Leu 
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20 25 30 



Pro Lys Tyr Ala Gly Leu 
35 



(2) INPORMATIW FOR SEQ ID ^K): 527: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 

Met Pro Arg Lys Thr Ser Lys Cys Arg Gin Leu Leu Cys Ser Gly Ala 
15 10 15 

Ser Arg Asn Ala Asp Thr Ala Ala Arg Gin Ser Thr Cys Ser Ser His 
20 25 30 

Arg Pro Pro Gly Lys He Pro Ser Leu Gly Pro Arg Arg Xaa Pro Gly 
35 40 45 

Cys Xaa Ser Val Pro Ser Ser Arg Gly Glu Gin Ser Thr Gly Ser Pro 
50 55 60 

Ala Ala Pro Arg Cys Gly Arg Arg Asp Ala His Arg Gly Leu Pro Gly 
65 70 75 80 

Gly Ala Ala Met Thr Pro Gly Asp Thr Trp Ala Ser Phe Asn Pro Arg 
85 90 95 

Ala Gly His Ser Lys Ser Gin Gly Glu Gly Gin Glu Ser Ser Gly Ala 
100 105 110 

Ser Arg Gin Asp Arg His Pro Val Ser His Trp Val Glu Arg Gin Arg 
115 * * 120 125 

Glu Ala Trp Gly Ala Pro Arg Ser Ser Ser Ala Gly Gly Val Lys Val 
130 135 140 

Ala Ala Thr Thr Glu Arg Glu Pro Glu Phe Lys He Lys Thr Gly Lys 
145 150 155 160 

Ala 



(2) INFORMATION FOR SEQ ID NO: 528: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 



Cys Ser Gly Ala Ser Arg Asn Ala Asp Thr Ala Ala Arg Gin Ser Thr 
15 10 15 
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Cys Ser Ser His Arg Pro Pro Gly Lys lie Pro Ser Leu Gly Pro Arg 
20 25 30 

Arg Xaa Pro Gly Cys Xaa Ser Val Pro Ser Ser Arg Gly Glu Gin Ser 
35 40 45 

Thr Gly Ser Pro Ala Ala Pro Arg Cys Gly Arg Arg Asp Ala His Arg 
50 55 60 

Gly Leu Pro Gly Gly Ala Ala Met Thr Pro Gly Asp Thr Trp Ala Ser 
65 70 75 80 

Phe Asn Pro Arg Ala Gly His Ser 
85 



(2) INFOFMATIGN FOR SEQ ID NO:. 529: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529: 

Gin Gly Glu Gly Gin Glu Ser Ser Gly Ala Ser Arg Gin Asp Arg His 
1 5 10 15 

Pro Val Ser His Trp Val Glu Arg Gin Arg Glu Ala Trp Gly Ala Pro 
20 25 30 

Arg Ser Ser Ser Ala Gly Gly Val Lys Val Ala Ala Thr Thr Glu Arg 
35 40 45 

Glu Pro Glu Phe Lys lie Lys Thr Gly Lys Ala 
50 55 



(2) INFORMATION FOR SEQ ID NO: 530: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 530: 

Met Ser Pro Arg Tyr Pro Gly Gly Pro Arg Pro Pro Leu Arg He Pro 
1 5 10 15 

Asn Gin Ala Leu Gly Gly Val Pro Gly Ser Gin Pro Leu Leu Pro Ser 
20 25 30 

Gly Met Asp Pro Thr Arg Gin Gin Gly His Pro Asn Met Gly Gly Pro 
35 40 45 

Met Gin Arg Met Thr Pro Pro Arg Gly Met Val Pro Leu Gly Pro Gin. 
50 55 60 
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Asn Tyr Gly Gly Ala Met Arg Pro Pro Leu Asn Ala Leu Gly Gly Pro 
65 70 75 80 

Gly Met Pro Gly Met Asn Met Gly Pro Gly Gly Gly Arg Pro Trp Pro 
5 85 90 95 

Asn Pro Thr Asn Ala Asn Ser He Pro Tyr Ser Ser Ala Ser Pro Gly 
100 105 110 

10 Asn Tyr Val Gly Pro Pro Gly Gly Gly Gly Pro Pro Gly Thr Pro He 
115 120 125 

Met Pro Ser Pro Ala Asp Ser Thr Asn Ser Gly Asp Asn Met Tyr Thr 
130 135 140 

15 

Leu Met Asn Ala Val Pro Pro Gly Pro Asn Arg Pro Asn Phe Pro Met 
145 150 155 160 

Gly Pro Gly Ser Asp Gly Pro Met Gly Gly Leu Gly Gly Met Glu Ser 
20 165 170- 175 

His His Met Asn Gly Ser Leu Gly Ser Gly Asp Met Asp Ser He Ser 
180 185 190 

25 Lys Asn Ser Pro Asn Asn Met Ser Leu Ser Asn Gin Pro Gly Thr Pro 
195 200 205 

Arg Asp Asp Gly Glu Met Gly Gly Asn Phe Leu Asn Pro Phe Gin Ser 
210 215 220 

30 

Glu Ser Tyr Ser Pro Ser Met Thr Met Ser Val 
225 230 235 



35 

(2) INFORMATION FOR SEQ ID NO: 531: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 

Met Ser Pro Arg 4)ft. Pro Gly Gly Pro Arg Pro Pro Leu Arg He Pro 
45 1 5 10 15 

Asn Gin Ala Leu Gly Gly Val Pro Gly Ser Gin Pro Leu Leu Pro Ser 
20 25 30 

50 Gly Met Asp Pro Thr Arg Gin Gin Gly His Pro Asn Met Gly Gly Pro 
35 40 45 

Met Gin Arg Met Thr Pro Pro Arg Gly Met Val Pro Leu Gly Pro Gin 
50 55 60 

55 

Asn Tyr Gly Gly Ala Met Arg Pro Pro Leu Asn Ala Leu Gly Gly Pro 
65 70 75 .80 

Gly Met Pro Gly Met Asn Met Gly Pro Gly Gly Gly Arg Pro Trp Pro 
60 85 90 95 
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Asn Pro Thr Asn Ala Asn Ser lie Pro Tyr Ser Ser Ala Ser Pro Gly 
100 105 110 



Asn Tyr 



(2) INFORMATION FOR SEQ ID NO: 532: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTTH: 81 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 532: 

Leu Asn Ala Leu Gly Gly Pro Gly Mfet Pro Gly Met Asn Met Gly Pro 
15 10 15 

Gly Gly Gly Arg Pro Trp Pro Asn Pro Thr Asn Ala Asn Ser He Pro 
20 25 30 

Tyr Ser Ser Ala Ser Pro Gly Asn Tyr Val Gly Pro Pro Gly Gly Gly 
35 40 45 

Gly Pro Pro Gly Thr Pro He Met Pro Ser Pro Ala Asp Ser Thr Asn 
50 55 60 

Ser Gly Asp Asn Met Tyr Thr Leu Met Asn Ala Val Pro Pro Gly Pro 
65 70 75 80 

Asn 



(2) INFORMATION FOR SEQ ID NO: 533: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 

Gly Pro Met Gly Gly Leu Gly Gly Met Glu .Ser His His Mfet Asn Gly 
1 5 . 10 15 

Ser Leu Gly Ser Gly Asp Met Asp Ser He Ser Lys Asn Ser Pro Asn 
20 25 30 

Asn Met Ser Leu Ser Asn Gin Pro Gly Thr Pro Arg Asp Asp Gly Glu 
35 40 45 

Met Gly Gly Asn Phe Leu Asn Pro Phe Gin Ser Glu Ser Tyr Ser Pro 
50 55 60 



Ser Met Thr Met Ser Val 
65 70 
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(2) INFORMATION FOR SEQ ID NO: 534; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE^3GTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534: 

Thr Cys Glu His Ser Ser Glu Ala Lys Ala Phe His Asp Tyr 
15 10 



(2) INFORMATION FOR SEQ ID NO: 535: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535: 

Gin Ala Phe Val Leu Leu Ser Asp Leu Leu Leu He Phe Ser Pro Gin 
1.5 10 15 

Met He Val Gly Gly Arg Asp Phe Leu Arg Pro Leu Val Phe Phe Pro 
20 25 • 30 

Glu Ala Thr Leu Gin Ser Glu Leu Ala Ser Phe Leu Met Asp His Val 
35 40 45 

Phe He Gin Pro Gly Asp Leu Gly Ser Gly Ala 
50 55 



(2) INFORMATION FOR SEQ ID NO: 536: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536: 

Ala Cys Ser Tyr Leu Leu Cys Asn Pto Glu Phe Thr Phe Phe Ser Arg 
1 5 10 15 • 

Ala Asp Phe Ala Arg Ser Gin Leu Val Asp Leu Leu Thr Asp Arg Phe 
20 25 30 

Gin Gin Glu. Leu Glu Glu Leu Leu Gin Val Gly 
35 40 . 



(2) INFORMATION FOR SEQ ID NO: 537: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537: 

Gin Lys Gin Leu Ser Ser Leu Arg Asp Arg Met Val Ala Phe Cys Glu 
1 5 10 15 

Leu Cys Gin Ser Cys Leu Ser Asp Val Asp Thr Glu lie Gin Glu Gin 
20 25 30 

Val Ser Thr 
35 



(2) INFORMATICaJ FOR SEQ ID NO: 538: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE23GTH: 27 amino acids 

{B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538: 

Gin Val lie Leu Pro Ala Leu Thr Leu Val Tyr Phe Ser lie Leu Trp 
15 10 15 

Thr Leu Thr His He Ser Lys Ser Asp Ala Ser 
20 25 



(2) INFORMATICS^ FOR SEQ ID NO: 539: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J : SEQ ID NO: 539: 

Ser Thr His Asp Leu Thr Arg Trp Glu Leu Tyr Glu Pro Cys Cys Gin 
15 10 15 

Leu Leu Gin Lys Ala Val Asp Thr Gly Xaa Val Pro His Gin Val 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 540: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC2N: SEQ ID NO: 540: 

Leu Ala val Ser Thr Ser Phe He Cys Cys Ala Asp He Ser Thr Ala 
15 10 15 



Leu Pro Leu Gly Ser Ser Arg Pro Ala Pro Ala Pro Arg His Arg Glu 
20 25 30 
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His Glu His Gly His Gin Ala Arg Pro Pro Arg Leu Leu Xaa Thr Ser 
35 40 45 

Leu Met Pro Leu Ser Thr Pro Ala Ala Ala Gin Leu Leu Trp Thr Gin 
50 55 60 

Leu Thr Pro Met Gly Gly Arg Pro Gly Gly Arg His Ser Pro Pro Thr 
65 70 75 80 

Leu His Thr Gly Pro Arg Ala Leu Pro Pro Gly Pro Pro His Pro Ser 
85 90 95 

Leu His Val Ala Ala Leu Ser Leu Leu Arg 
100 105 



(2) INFORMATION FOR SEQ ID NO: 541: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: . SEQ ID NO: 541: 

Glu Gin Val. Leu Ala Leu Leu Trp Pro Arg Phe Glu Leu lie Leu Glu 
1 5 10 15 

Met Asn Val Gin Ser Val Arg Ser Thr Asp Pro Gin Arg Leu Gly Gly 
20 25 30 

Leu Asp Thr Arg Pro His Tyr lie Thr Arg Arg Tyr Ala Glu Phe Ser 
35 40 45 

Ser Ala Leu Val Ser lie Asn Gin Thr lie Pro Asn Glu Arg Thr Met 
50 55 60 

Gin Leu Leu Gly Gin Leu Gin Val Glu Val Glu Asn Phe Val Leu Arg 
65 70 75 80 

Val Ala Ala Glu Phe Ser Ser Arg Lys Glu Gin Leu Val Phe Leu lie 
85 90 95 

Asn Asn Tyr Asp Met Met Leu Gly Val Leu Met Glu Arg Ala Ala Asp 
100 105 110 

Asp Ser. Lys Glu Val Glu Ser Phe Gin Gin Leu Leu Asn Ala Arg Thr 
115 120 125 

Gin Glu Phe lie Glu Glu Leu Leu Ser Pro Pro Phe Gly Gly Leu Val 
130 135 140 

Ala Phe Val Lys Glu Ala Glu Ala Leu lie Glu Arg Gly Gin Ala Glu 
145 150 155 160 

Arg Leu Arg Gly Glu Glu Ala Arg Val Thr Gin Leu lie Arg Gly Phe 
165 170 175 



Gly Ser Ser Trp Lys Ser Ser Val Glu Ser Leu Ser Gin Asp Val Met 
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180 185 190 

Arg Ser Phe Thr Asn Phe Arg Asn Gly Thr Ser lie lie Gin Gly 
195 200 205 

5 



(2) INFORMATION FOR SEQ ID NO: 542: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 

15 

Ala Leu Leu Lys Tyr Arg Phe Phe Tyr Gin Phe Leu Leu Gly Asn Glu 

1 5 10 . 15 

Arg Ala Thr Ala Lys Glu lie Arg Asp Glu Tyr Val Glu Thr Leu Ser 
20 .20 25 . 30 

Lys lie Tyr Leu Ser Tyr Tyr Arg Ser Tyr Leu Gly Arg Leu Met Lys 
35 40 45 

25 I Val Gin Tyr Glu Glu Val Ala Glu Lys Asp Asp Leu Met Gly Val Glu 
50 55 60 



30 



Asp Thr Ala Lys Lys Gly Phe Xaa Ser Lys Pro Ser Leu Arg Ser Arg 
65 70 75 80 

Asn Thr lie Phe Thr Leu Gly Thr Arg Gly Ser Val lie Ser Pro Thr 
85 90 95 



Glu Leu Glu Ala Pro lie Leu Val Pro His Thr Ala Gin Arg 
35 100 105 110 



40 



(2) INFORMATICaJ FOR SEQ ID NO: 543: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 

Glu Gin Arg Tyr Pro Phe Glu Ala Leu Phe Arg Ser Gin His Tyr Xaa 
15 10 15 

50 Leu Leu Asp Asn Ser Cys Arg Glu Tyr Leu Phe He Cys Glu Phe Phe 
20 25 30 

Val Val Ser Gly Pro Xaa Ala His Asp Leu Phe His Ala Val Met Gly 
35 40 45 

55 

Arg Thr Leu Ser Met Thr Leu Lys His Leu Asp Ser Tyr Leu Ala Asp 
50 55 60 ■ 

Cys Tyr Asp Ala He Ala Val Phe Leu Cys He His He Val Leu Arg 
60 65 70 75 80 
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Phe Arg Asn lie Ala Ala Lys Arg Asp Val Pro Ala Leu Asp Arg Tyr 
85 90 95 



Trp 



(2) INFORMATION FOR SEQ ID NO: 544: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: jamino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544: 

Gly Gly Leu Asp Thr Arg Pro His Tyr He Thr Arg Arg Tyr Ala Glu 
1 5 10 15 

Phe Ser Ser Ala Leu Val Ser He Asn Gin 
20 25 



(2) INFORMATION FOR SEQ ID NO: 545: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545: 

Ser Arg Lys Glu Gin Leu Val Phe Leu He Asn Asn Tyr Asp Met Met 
1 5 10 15 

Leu Gly Val Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 546: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 

Ala Leu Leu Lys Tyr Arg Phe Phe Tyr Gin Phe Leu Leu Gly Asn Glu 
1 5 10 IS 

Arg Ala Thr Ala Lys Glu He Arg Asp Glu Tyr Val Glu Thr Leu Ser 
20 25 30 

Lys He Tyr Leu Ser Tyr Tyr Arg Ser Tyr Leu Gly Arg Leu Met Lys 
35 40 45 

Val Gin Tyr Glu Glu Val Ala Glu Lys Asp Asp Leu Met Gly Val Glu 
50 55 60 
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Asp Thr Ala Lys Lys Gly Phe Xaa Ser Lys Pre Ser Le*- Arg Ser Arg 
65 70 75 5: 

5 Asn Thr lie Phe Thr Leu Gly Thr Arg Gly Ser Val He Ser ?rs Tr^ 
85 90 r5 



10 



25 



40 



55 



Glu Leu Glu Ala Pro He Leu Val Pro His Thr Ala Glr. Arg Zaa 31*- 
100 105 liO 

Gin Arg Tyr Pro Phe Glu Ala Leu Phe Arg Ser Gin His Tyr Zaa I^u 
115 120 125 



Leu Asp Asn Ser Cys Arg Glu Tyr Leu Phe He Cys Gl- Phe ?r.e Val 
15 130 135 140 

Val Ser Gly Pro Xaa Ala His Asp Leu Phe His Ala Val Ket Gly A^g 
145 150 155 is: 

20 Thr Leu Ser Met Thr Leu Lys His Leu Asp Ser T^-r Le- A-la As? Cys 
165 170 1'5 



Tyr Asp Ala He Ala Val Phe Leu Cys He His He Val 1-eu Arg Phe 
180 185 130 

Arg Asn He Ala Ala Lys Arg Asp Val Pro Ala Lstjl Asp Arg Z'rr Trp 
195 200 2C5 



Glu Gin Val Leu Ala Leu Leu Trp Pro Arg Phe Glu Leu He Lsu 31'- 
30 210 215 ' 220 

Met Asn Val Gin Ser Val Arg Ser Thr Asp Pre Gin Arg l«u Gly Sly 
225 230 235 241 

35 Leu Asp Thr Arg Pro His Tyr He Thr Arg Arg Tyr Ala Glu P'r.s Ser 
245 250 2=5 



Ser Ala Leu Val Ser He Asn Gin Thr He Pro Asn Glu Arg rrjr Y.ez 
260 265 170 

Gin Leu Leu Gly Gin Leu Gin Val Glu Val Glu Asn Phe Val Leu Arg 
275 280 235 



Val Ala Ala Glu Phe Ser Ser Arg Lys Glu Gin Leu Val Phe Leu He 
45 290 295 330 

Asn Asn Tyr Asp Met Met Leu Gly Val Leu Mec Glu Arg A_la Ala Asp 
305 310 315 320 

50 Asp Ser Lys Glu Val Glu Ser Phe Gin Gin Leu Leu Asn Ala Arg llsr 
325 330 325 



Gin Glu Phe He Glu Glu Leu Leu Ser Pro Pro Phe Gly Gly Leu Val 
340 345 350 

Ala Phe Val Lys Glu Ala Glu Ala Leu He Glu Arg Gly Gin Ala Glu 
355 360 365 



Arg Leu Arg Gly Glu Glu Ala Arg Val Thr Gin Leu He Arg Gly Phe 
60 370 375 330 
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Gly Ser Ser Trp Lys Ser Ser Val Glu Ser Leu Ser Gin Asp Val Met 
385 390 395 400 

Arg Ser Phe Thr Asn Phe Arg Asn Gly Thr Ser 
405 ' 410 



(2) INFORMATION FOR SEQ ID NO: 547: 

(i) SEQUENCE CHARACTERISTICS: • 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 

Tyr Glu Gly Lys Glu Phe Asp Tyr Val Phe Ser lie Asp Val Asn Glu 
1 5 • 10 15 

Gly Gly Pro Ser Tyr Lys Leu Pro Tyr Asn Thr Ser Asp Asp Pro Trp 
20 25 30 

Leu Thr Ala Tyr Asn Phe Leu Gin Lys Asn Asp Leu Asn Pro Met Phe 
35 40 45 

Leu Asp Gin Val Ala Lys Phe He He Asp Asn Thr Lys Gly Gin Met 
50 55 60 . 

Leu Gly Leu Gly Asn Pro Ser Phe Ser Asp Pro Phe Thr Gly Gly Gly 

65 70 - 75 . 80 

Arg Tyr Val Pro Gly Ser Ser Gly Ser Ser Asn Thr Leu Pro Thr Ala 
85 90 95 

Asp Pro Phe Thr Gly Ala Gly Arg Tyr Val Pro Gly Ser Ala Ser Met 
100 ^ . 105 110 

Gly Thr Thr Met Ala Gly Val Asp Pro Phe Thr Gly Asn Ser Ala Tyr 
115 120 125 

Arg Ser Ala Ala Ser Lys Thr Met Asn He Tyr Phe Pro Lys Lys Glu 
130 135 140 

Ala Val Thr Phe Asp Gin Ala Asn Pro Thr Gin He Leu Gly Lys Leu 
145 150 155 160 

Lys Glu Leu Asn Gly Thr Ala Pro Glu Glu Lys Lys Leu Thr Glu Asp 
165 170 175 

Asp Leu He Leu Leu Glu Lys He Leu Ser Leu He Cys Asn Ser Ser 
180 185 190 

Ser Glu Lys Pro Thr Val Gin Gin Leu Gin He Leu Trp Lys Ala He 
195 200 205 

Asn cys Pro Glu Asp He Val Phe Pro Ala Leu Asp He Leu Arg Leu 
210 215 220 



Ser He Lys His Pro Ser Val Asn Glu Asn Phe Cys Asn Glu Lys Glu 
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225 230 235 240 

Gly Ala Gin Phe Ser Ser His Leu lie Asn Leu Leu Asn Pro Lys Gly 
245 250 255 

Lys Pro Ala Asn Gin Leu Leu Ala Leu Arg Thr Phe Cys Asn Cys Phe 

260 265 270 

Val Gly Gin Ala Gly Gin Lys Leu Met Met Ser Gin Arg Glu Ser Leu 
275 280 285 

Met Ser His Ala lie Glu Leu Lys Ser Gly Ser Asn Lys Asn He 
290 295 300 



(2) INFORMATION FOR SEQ ID NO: 548: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J : SEQ ID NO: 548: 

His He Ala Leu Ala Thr Leu Ala Leu Asn Tyr Ser Val Cys Phe His 
15 10 15 

Lys Asp 



(2) INFORMATION FOR SEQ ID NO: 549: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549: 

His Asn He Glu Gly Lys Ala Gin Cys Leu Ser Leu He Ser Thr He 
1 5 10 15 

Leu Glu Val Val Gin Asp Leu Glu Ala Thr Phe Arg Leu Leu Val Ala 
20 25 30 

Leu Gly Thr Leu He Ser Asp Asp Ser Asn Ala Val Gin Leu Ala Lys 
35 40 45 

Ser 



(2) INFORMATIC»i FOR SEQ ID NO: 550: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 

Leu Gly Val Asp Ser Gin He Lys Lys Tyr Ser Ser Val Ser Glu Pro 
15 10 15 

Ala* Lys Val Ser Glu Cys Cys Arg Phe He Leu Asn Leu Leu 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 551: 

y 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 

Tyr Glu Gly Lys Glu Phe Asp Tyr Val Phe Ser He Asp Val Asn Glu 
15 10 15 

Gly Gly Pro Ser Tyr Lys Leu Pro Tyr Asn Thr Ser Asp Asp Pro Trp 
20 25 30 

Leu Thr Ala Tyr Asn Phe Leu Gin Lys Asn Asp Leu Asn Pro Mfet Phe 
35 40 45 

Leu Asp Gin Val Ala Lys Phe He He Asp Asn Thr Lys Gly Gin Mfet 
50 55 60 

Leu Gly Leu Gly Asn Pro Ser Phe Ser Asp Pro Phe Thr Gly Gly Gly 
65 70 75 80 

Arg Tyr Val Pro Gly Ser Ser Gly Ser Ser Asn Thr Leu Pro Thr Ala 
85 90 95 

Asp Pro Phe Thr Gly Ala G^y Arg Tyr Val Pro Gly Ser Ala Ser Met 
100 105 110 

Gly Thr Thr Met Ala Gly Val Asp Pro Phe Thr Gly Asn Ser Ala Tyr 
115 120 125 

Arg Ser Ala Ala Ser Lys Thr Met Asn He Tyr Phe Pro Lys Lys Glu 
130 135 140 

Ala Val Thr Phe Asp Gin Ala Asn Pro Thr Gin He Leu Gly Lys Leu 
145 150 155 160 

Lys Glu Leu Asn Gly Thr Ala Pro Glu Glu Lys Lys Leu Thr Glu Asp 
165 170 175 

Asp Leu He Leu Leu Glu Lys He Leu Ser Leu He Cys Asn Ser Ser 
180 185 190 

Ser Glu Lys Pro Thr Val Gin Gin Leu Gin He Leu Trp Lys Ala He 
195 200 205 

Asn Cys Pro Glu Asp He Val Phe Pro Ala Leu Asp He Leu Arg Leu 
210 215 220 
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Ser He Lys His Pro Ser Val Asn Glu Asn Phe Cys Asn Glu Lys Glu 
225 230 235 240 

Gly Ala Gin Phe Ser Ser His Leu He Asn Leu Leu Asn Pro Lys Gly 
5 245 250 255 

Lys Pro Ala Asn Gin Leu Leu Ala Leu Arg Thr Phe Cys Asn Cys Phe 
260 265 270 

10 Val Gly Gin Ala Gly Gin Lys Leu Met Met Ser Gin Arg Glu Ser Leu 
275 280 285 

Met Ser His Ala He Glu Leu Lys Ser Gly Ser Asn Lys Asn He His 
290 295 300 

15 

He Ala Leu Ala Thr Leu Ala Leu Asn Tyr Ser Val Cys Phe His Lys 
305 310 315 320 

Asp His Asn He Glu Gly Lys Ala Gin Cys Leu Ser Leu He Ser Thr 
20 325 330 335 

He Leu Glu Val Val Gin Asp Leu Glu Ala Thr Phe Arg Leu Leu Val 
340 345 350 

25 Ala Leu Gly Thr Leu He Ser Asp Asp Ser Asn Ala Val Gin Leu Ala 
355 360 365 

Lys Ser Leu Gly Val Asp Ser Gin He Lys Lys Tyr Ser Ser Val Ser 
370 375 380 

30 

Glu Pro Ala Lys Val Ser Glu Cys Cys Arg 'Phe He Leu Asn Leu Leu 
385 390 395 400 



35 



(2) INFORMATION FOR SBQ ID NO: 552: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 

Tyr Pro Asn Gin Asp Gly Asp He Leu Arg Asp Gin Val Leu His Glu 
1 5 10 15 

50 His He Gin Arg Leu Ser Lys Val Val Thr Ala Asn His Arg Ala Leu 
20 25 30 

Gin He Pro Glu Val Tyr Leu Arg Glu Ala Pro Trp Pro Ser Ala Gin 
35 40 45 

55 

Ser Glu He Arg Thr He Ser Ala Tyr Lys Thr Pro Arg Asp Lys Val 
50 55 60 

Gin Cys He Leu Arg Mfet Cys Ser Thr He Met Asn Leu Leu Ser Leii 
60 65 70 75 80 
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Ala Asn Glu Asp Ser Val Pro Gly Ala Asp Asp Phe Val Pro Val Leu 
85 90 95 

5 Val Phe Val Leu lie Lys Ala Asn Pro Pro Cys Leu Leu Ser Thr Val 
100 105 110 

Gin Tyr He Ser Ser Phe Tyr Ala Ser Cys Leu Ser Gly Glu Glu Ser 
115 120 125 

10 

Tyr Trp Trp Met Gin Phe Thr Ala Ala Val Glu 
130 135 



15 

(2) INFORMATION FOR SEQ ID NO: 553: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTICW : SEQ ID NO: 553: 

Tyr Pro Asn Gin Asp Gly Asp He Leu Arg Asp Gin Val Leu His Glu 
25 1 5 10 15 

His He Gin Arg Leu Ser Lys Val Val Thr Ala Asn His Arg Ala Leu 
20 25 30 

30 Gin He Pro Glu Val Tyr Leu Arg Glu Ala Pro Trp Pro Ser Ala Gin 
35 40 45 

Ser Glu He Arg Thr He Ser Ala Tyr Lys Thr Pro Arg Asp Lys Val 
50 55 60 

35 

Gin Cys He Leu Arg Met Cys Ser Thr He Met Asn Leu Leu Ser Leu 
65 70 75 80 

Ala Asn Glu Asp Ser Vad Pro Gly Ala Asp Asp Phe Val Pro Val Leu 
40 85 90 95 

Val Phe Val Leu He Lys Ala Asn Pro Pro Cys Leu Leu Ser Thr Val 
100 105 110 

45 Gin Tyr He Ser Ser Phe Tyr Ala Ser Cys Leu Ser Gly Glu Glu Ser 
115 120 125 

Tyr Trp Trp Met Gin Phe Thr Ala Ala Val Glu Phe He Lys Thr He 
130 135 140 

50 



55 

(2) INFORMATION FOR SEQ ID NO: 554: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 554: 

Tyr Pro Asn Gin Asp Gly Asp He Leu Arg Asp Gin Val Leu 
15 10 



(2) INFOFMATION FOR SEQ ID NO: 555: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555: 

Glu Ala Pro Trp Pro Ser Ala Gin Ser Glu He 
15 10 



(2) INFORMATION FOR SEQ ID NO: 556: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 556: 

Ser Gly Glu Glu Ser Tyr Trp Trp Met Gin Phe Thr Ala Ala Val Glu 
1 5 10 15 

Phe He Lys Thr He 
20 



(2) INFORMATION FOR SEQ ID NO: 557: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557: 

Ala Asp Asp Phe Val Pro Val Leu Val Phe Val Leu He Lys Ala Asn 
1 5 10 15 

Pro Pro 



(2) INFORMATION FOR SEQ ID NO: 558: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 558: 
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Tyr Lys Thr Pro Arg Asp Lys Val Gin Cys He Leu 
15 10 

5 

(2) INFORMATION FOR SEQ ID NO: 559: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559: 

15 Gly Ala Asp Asp Phe Val Pro Val Leu Val Phe Val Leu He Lys 
1^5 10 15 



20 (2) INFORMATION FOR SEQ ID NO: 560: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
25 <D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560: 

Pro Val Leu Val Phe Val Leu He Lys Ala Asn Pro 
15 10 

30 



(2) INFORMATION FOR SEQ ID NO: 561: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: a©ino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOI : SEQ ID NO: 561: 



40 



45 



Ser Ala Arg Ala Ser Thr Gin Pro Pro Ala Gly Gin His Pro Gly Pro 
15 10 15 

Cys 



(2) INFORMATION FOR SEQ ID NO: 562: 

50 

(1) SEQUENCE CHARACTERISTICS: 

(A) LQKjTH: 33 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 

Met Pro Gly Arg Trp Arg Trp Gin Arg Asp Met His Pro Ala Arg Lys 
1 5 10 15 . 

60 Leu Leu Ser Leu Leu Phe Leu He Leu Met Gly Thr Glu Leu Thr Gin 
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Asp 
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25 



30 



(2) INFORMATION FOR SEQ ID NO: 563: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGy: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563: n 

Ser Ala Ala Pro Asp Ser Leu Leu Arg Ser Ser Lys Gly Ser Thr Arg 
1 5 10 15 

Gly Ser Leu 



(2) INFORMATION FOR SEQ ID NO: 564: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564: 

Ala Ala He Val He Trp Arg Gly Lys Ser Glu Ser Arg He Ala Lys 
15 10 15 

Thr Pro Gly He 
20 



(2) INFORMATION FOR SEQ ID NO: 565: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565: 

Pro Leu Gly He Thr Leu Pro Leu Gly Ala Pro Glu Thr Gly Gly Gly 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566: 

Cys Ala Ala Glu Thr Trp Lys Gly Ser Glxi Arg Ala Gly Gin Leu Cys 
15 10 15 

Ala Leu Xjeu Ala 
20 



(2) INFOREIATION FOR SEQ ID NO: 567: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567: 

Phe Arg Gly Gly Gly Thr Leu Val Leu Pro Pro Thr His Thr Pro Glu 
1 5 10 15 

Trp Leu lie Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 568: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568: 

Met Arg Ser Ala Arg Pro Ser Leu Gly Cys Leu Pro Ser Trp Ala Phe 
15 10 15 

Ser Gin Ala Leu Asn lie 
20 



(2) INFORMATION FOR SEQ ID NO: 569: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569: 

Leu Leu Gly Leu Lys Gly Leu Ala Pro Ala Glu He Ser Ala Val Cys 
' 1 5 10 15 

Glu Lys Gly Asn Phe Asn 
20 



(2) INFORMATION FOR SEQ ID NO: 570: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TTPE: amino acid 
(D) TOPOLOGEY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570: 

Val Ala His Gly Leu Ala Trp Ser Tyr Tyr He Gly Tyr Leu Arg Leu 
15 10 15 

He Leu Pro Glu Leu Gin Ala Arg He Arg 
20 25 



(2) INFORMATION FOR SEQ ID NO: 571: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571: 

Thr Tyr Asn Gin His Tyr Asn Asn Leu Leu Arg Gly Ala Val Ser Gin 
15 10 15 

Arg Cys 



(2) INPORMATIOI FOR SEQ ID NO: 572: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572: 

He Leu Leu Pro Leu Asp Cys Gly Val Pro Asp Asn Leu Ser Met Ala 

1 .5 10 15 

Asp Pro Asn He Arg Phe Leu Asp Lys Leu Pro Gin Gin Thr Gly Asp 
20 25 30 

Arg Ala Gly He Lys Asp Arg Val Tyr Ser Asn 
35 40 



(2) INFORMATION FOR SEQ ID NO: 573: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID m: 573: 



Ser He Tyr Glu Leu Leu Glu Asn Gly Gin Arg Ala Gly Thr Cys Val 
15 10 15 
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Leu Glu Tyr Ala Thr Pro Leu Gin Thr Leu Phe Ala Met Ser Gin Tyr 
20 ' 25 30 

Ser Gin Ala Gly Phe Ser Gly Glu Asp Arg Leu Glu Gin 
35 40 - 45 



(2) INFORMATION FOR SEQ ID NO: 574: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEC^GTH: 92 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574: 

Ala Lys Leu Phe Cys Arg Thr Leu Glu Asp lie Leu Ala Asp Ala Pro 
15 10 15 

Glu Ser Gin Asa Asn Cys Arg Leu lie Ala Tyr Gin Glu Pro Ala Asp 
20 25 30 

Asp Ser Ser Phe Ser Leu Ser Gin Glu Val Leu Arg His Leu Arg Gin 
35 40 45 

Glti Glu Lys Glu Glu Val Thr Val Gly Ser Leu Lys Thr Ser Ala Val 
50 55 60 

Pro Ser Thr Ser Thr Met Ser Gin Glu Pro Glu Leu Leu lie Ser Gly 
65 70 75 80 

Met Glu Lys Pro Leu Pro Leu Arg Thr Asp Phe Ser 
85 90 



(2) INFORMATION FOR SEQ ID NO: 575: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575: 

Leu Leu Gly Leu Lys Gly Leu Ala Pro Ala Glu lie Ser Ala Val Cys 
1 5 10 15 

Glu Lys Gly Asn Phe Asn Val Ala His Gly Leu Ala Trp Ser Tyr Tyr 
20 25 30 

lie Gly Tyr Leu Arg Leu lie Leu Pro Glu Leu 
35 40 



(2) INFORMATION FOR SEQ ID NO: 576: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
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■3) T/?E: andno acid 
(D) TOPOLOGY: linear 
(xi) S3Q^XE>XS DESCRIPTION: SEQ ID NO: 576: 

Thr Xsc :.ys Lau Leu Lys Leu Arg Arg Asn He Val Lys Leu Ser Leu 
1.5 10 15 

Tyr Arg His ?r.s Thr Asn 
2Z 



(2) ZlTFCPlG^ZZZi: FOR SEQ ID NO: 577: 

(i) SEQUENCE C-iARACTERISTICS: 

(A) LZiXSTH: 22 amino acids 

(3) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577: 

Thr Leu lie leu Ala Val Ala Ala Ser He Val Phe He He Trp Thr 
15 10 15 

Thr Kez Lys Phe Arg He 
2 Z 



(2) i:3C?MATIC^: FOR SEQ ID NO: 578: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE^X?^H: 28 amino acids 

B) TYPE: amino acid 

Zt TOPOLOGY: linear 
{yd) SEQUEfJCS DESCRIPTI<»I : SEQ ID NO: 578: 

Val Tr-r Cys Glr. Ser Asp Trp Arg Glu Leu Trp Val Asp Asp Ala He 
1 5 ^ ' 10 15 

Trp Arg Leu Leu Phe Ser Met He Leu Phe Val He 
2C 25 



(2) riFCRMATXCN FOR SEQ ID NO: 579: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 27 amino acids 

(3) TYPE: amino acid 

(D) TOPOLOGY: linear 
(3Ci) SEQUENCE DESCRIPTION: SEQ ID NO: 579: 

Met Val Leu Trp Arg Pro Ser Ala Asn Asn Gin Arg Phe Ala Phe Ser 
1 5 10 15 

Pro Leu Ser Glu Glu Glu Glu Glu Asp Glu Gin 
20 25 
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(2) I^3F0HMATI0N FOR SEQ ID NO: 580: 

(i) SEQUENCE CHARACTERISTICS: 

, (A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580: 

Met Val Leu Trp Arg Pro Ser Ala Asn Asn Gin Arg Phe Ala Phe Ser 
15 10 15 

Pro Leu Ser Glu Glu Glu Glu Glu Asp Glu Gin 
20 25 



(2) INFORMATICS FOR SEQ ID NO: 581: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(6) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIFTICN: SEQ ID NO: 581: 

Lys Glu Pro Met Leu Lys Glu Ser Phe Glu Gly Met Lys Met Arg Ser 
1 5 10 15 

Thr Lys Gin Glu Pro Asn Gly Asn Ser Lys Val Asn Lys Ala Gin Glu 
20 25 30 

Asp Asp Leu 
35 



(21 INFORMATION FOR SEQ ID NO: 582: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582: 

Lys Trp Val Glu Glu Asn Val Pro Ser Ser Val Thr Asp Val Ala Leu 
15 10 15 

Pro Ala' Leu Leu Asp Ser Asp Glu Glu Arg Met lie Thr His Phe Glu 
20 25 30 

Arg Ser Lys Met Glu 
35 



(2) INFORMATION FOR SEQ ID NO: 583: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(Xi) S33UE:JCZ ZZBZRZyZZy.:: SZZ 1:3 :*Z: Er2; 

Asp Pro Arg Val Arg l«u .--sn Ser Leu Thr Cys Lys 3i 
1 5 1: 

Ser Leu Thr Gin 
23 



(2) INFORJGTICN ?C?. ZD yXDz =24: 

(i) SHC-JENCZ CH?J^JCTZ?-STZrS: 

(A) LE:5:jr:-:: 13 s=^-- acids 

(3) TiTE: 2rir-0 arid 
(xi) SEQUSvCi 3-E5IRI?r-12r: £2Q n ITZ: ziiz 
Tyr Glu Pro Sfet Asp ?he Mer Ala Ler^ lie T>t Asp 



(2) INFORliAnCiM FC?. =3Q ID =23: 

(i) SS^UENCZ CHArACTS?::STiri: 

(B) Tr'rZ: anino acid 
(D) 7CP::10T/: lir.ear 
(xi) SSQUErrCE DESrRZPri:::: — rO: : = £: 

He Arg His Glu Leu rhr Val Iisu Arg Asp Trjr Arg ?ro Ala 2ys P^la 
1 5 1: ' 13 



(2) INFORMATION EC?. SZQ ZD :TDi zsz: 

- (1) SEQUEtiCE rr^iPACTZPJIcriCS: 

(A) 1^3Cn:~:: id a:i^-c acics 

(B) TiFEi anino acid 
(D) CCPOLCGlf: linear 

(xi) S3QUe:x:e i:Esr3c?Ti37: £22 Hi 

Met Asp Phe Xaa Kec Ala leu He r%-r An: 
1-5 1: 



(2) INFORMATICN FCR S3Q ID 230: =37: 

(i) SEQUENCE C3!A?ACrZRISjICS: 

(A) ISJCnH: 14 arinc acics 

(B) TiPE: aaiicio acid 
(D) TC90LCGY: lir^sar 

'(xi) S3QUE3CE DESISCPriaii: S3Q ID 2K): =3" 
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Met Gln GIu Met Met Arg Asn Gin Asp Arg Ala Leu Ser Asn Leu Glu 
1.5 10 15 

Ser He Pro Gly Gly Tyr Asn Ala 
20 



(2) INFORMATION FOR SEQ ID NO: 588: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588: 

Leu Arg Arg Met Tyr Thr Asp He Gin Glu Pro Met Leu Ser Ala Ala 
15 10 15 

Gin Glu Gin Phe Gly Gly Asn Pro Phe 
20 25 



(2) INFORMATION FOR SEQ ID NO: 589: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEE9GTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589: 

Ala Ser Leu Val Ser Asn Thr Ser Ser Gly Glu Gly Ser Gin Pro Ser 
1 5 10 15 

Arg Thr Glu Asn Arg Asp Pro Leu Pro Asn Pro Trp Ala Pro Gin Thr 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 590: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590: 

Ser Gin Ser Ser Ser Ala Ser Ser Gly Thr Ala Ser Thr Val Gly Gly 
15 10 15 

Thr Thr Gly Ser Thr Ala Ser Gly Thr Ser Gly Gin Ser Thr Thr Ala 
20 25 30 

Pro Asn Leu Val Pro Gly Val Gly Ala Ser Met Phe Asn Thr Pro Gly 
35 40 45 
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Met Gin Ser Leu Leu Gin Gin lie Thr Glu Asn Pro Gin Leu Met Gin 
50 55 60 

Asn Met Leu Ser Ala Pro Tyr 
65 .70 . 



(2) INFOFMATION FOR SBQ ID NO: 591: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591: 

Met Arg Ser Met Met Gin Ser Leu Ser Gin Asn Pro Asp Leu Ala Ala 
15 10 15 

Gin Met Met Leu Asn Asn Pro Leu Phe Ala Gly Asn Pro Gin Leu Gin 
20 25 30 

Glu Gin Met Arg Gin Gin Leu Pro Thr Phe Leu Gin Gin 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 592: 

(i) SEQUENCE CHARACTEEaSTICS : 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592: 

Met Gin Asn Pro Asp Thr Leu Ser Ala Met Ser Asn Pro Arg Ala Met 
X 5 10 15 

Gin Ala Leu Leu Gin lie Gin Gin Gly Leu Gin Thr Leu Ala Thr Glu 
20 25 30 

Ala Pro Gly Leu He Pro Gly Phe Thr Pro Gly Leu Gly Ala Leu Gly 
35 40 45 

Ser Thr Gly Gly Ser Ser Gly Thr Asn Gly Ser Asn Ala Thr Pro Ser 
50 55 60 

Glu Asn Thr Ser Pro Thr Ala Gly Thr 
65 70 



(2) INFORMATION FOR SEQ ID NO: 593: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEXKTTH: 72 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593: 
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3*u ^3 Gly His Gin Gin Phe He Gin Gin Met Leu Gin Ala Leu 
"1 ■ " 5 10 15 

1-v Val Asn Pro Gin Leu Gin Asn Pro Glu Val Arg Phe Gin Gin 
5 20 25 30 

-Sir. Glu Gin Leu Ser Ala Met Gly Phe Leu Asn Arg Glu Ala Asn 
35 40 45 

10 Z^Ji 31n Ala Leu He Ala Thr Gly Gly Asp He Asn Ala Ala He Glu 
50 55 60 



15 



Arg Leu Gly Ser Gin Pro Ser 
55 70 



{2) H^'CPiGiTION FOR SEQ ID NO: '594; 



20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594: 



25 



-r- P-o Ala Met Met Gin Glu Met Met Arg Asn Gin Asp Arg Ala 
' ' ' 5 10 15 



Asn Leu Glu Ser He Pro Gly Gly Tyr Asn Ala Leu Arg Arg 
30 ' " 20 25 30 

Ke:: 7-/r Thr Asp He Gin Glu Pro Met Leu Ser Ala Ala 
35 40 45 



35 



(2) Z::FCril€?LT10ti FOR SEQ ID NO: 595: 



(i) SEQUENCE CHARACTERISTICS: 
40 ' (A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595: 

45 GTy Asn Pro Phe Ala Ser Leu Val Ser Asn Thr Ser Ser 
1 5 10 



50 (2) I>IF0R:-IATI0N for SEQ id NO: 596: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596: 

Glu Asn Arg Asp Pro Leu Pro Asn Pro Trp Ala 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 597: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTI<»I: SEQ ID NO: 597: 

Gly Lys He Leu Lys Asp Gin Asp Thr Leu Ser Gin His Gly lie His 
15 10 15 

Asp 



(2) INFORMATION FOR SEQ ID NO: 598: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598: 

Gly Leu Thr Val His Leu Val lie Lys Thr Gin Asn Arg Pro 
15 10 



(2) INFORMATION FOR SEQ ID NO: 599: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599: 

Ser Glu Leu Gin Ser Gin Met Gin Arg Gin Leu Leu Ser Asn Pro Glu 
15 10 15 

Met Met 



(2) INFORMATION FOR SEQ ID NO: 600: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 600: 



Pro Glu He Ser His Met Leu Asn Asn Pro Asp He Met Arg 

1 " 5 . 10 . 
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(2) INrORI4ATICM FOR SEQ ZO NO: ^01: 

(i) SEQUHIXE OiAPACTSlISTICS: 

(A) LG3GTH: 18 aczlrio acids 

(3) TYPE: amino acid 

(D) TOPOLCGY: lir^ear 
(xi) SEQUENCE DESCSIPTICii: : SEQ ID ^;C: 601: 

Arg Gin Leu lie Met Ala Asn Pro Gin Mec Glr. Gin Leu lie Gin Arg 

15 i: 15 

Asn Pro 



(2) INFORMATION FOR SEQ ID NO: 502: 

(i) SEQUEIXilE CiAPJOEUSTICS: 

(A) LENGTH: 27 acino aci^ 

(3) TYPE: amino SLcid 

(D) TOPOLOGY: linsar 
(xi) SEQUENCE DESCRIPTICII: SEQ ID 130: 602: 

Asn Leu Cys .His Val Asp Cys Gin Asp Leu Leu Asn Pro Asn Leu Leu 
1 5 10 15 

Ala Gly He His Cys Ala Lys Arg He Val Ser 
20 25 



(2) INFORMATION FOR SEQ ID NO: 603: 

(i) SEQUE^7CE CHAPACTERISTICS: 

(A) LE^K3TH: 23 ardno acids 

(3) TYPE: aoiino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTICN: SEQ ID IZ): 503: 

Leu Aso Gly Phe Glu Gly Tyr Ser Leu Ser As? Irp Leu Cys Leu Ala 
1 ^ 5 10 15 

Phe Val Glu Ser Lys Phe Asn 
20 



(2) INFORMATICai FOR SEQ ID NO: 604: 

(i) SEQUraCE CHARACTERISTICS: 

(A) LESGTH: 22 aaino acids ■ 

(3) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTICN: SEQ ID 130 i 604: 

Asn Glu Asn Ala Asp Gly Ser Phe Asp Tyr Gly Leu Phe Gin He Asn 
1 5 10 15 



Ser His Tyr Trp Cys Ajsn 
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20 



(2) INFORMATIOJ FOR SEQ ID NO: 605: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605: 

Asn Leu Cys His Val Asp Cys Gin Asp Leu Leu Asn Pro Asn Leu Leu 
15 10 15 

Ala Gly He His Cys Ala Lys Arg lie Val Ser 
20 25 



(2) INFORMATIOI FOR SEQ ID NO: 606: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606: 

He Arg Glu Val Asn Glu Val He Gin Asn Pro Ala Thr 
15 10 



(2) INFORMATION FOR SEQ ID NO: 607: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:- 30 amino acids 

<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607: 

He Thr Arg He Leu Leu Ser His Phe Asn Trp Asp Lys Glu Lys Leu 
' 1 5 10 15 

Mfet Glu Arg Tyr Phe Asp Gly Asn Leu Glu Lys Leu Phe Ala 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 608: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608: 

Asn Thr Arg Ser Ser Ala Gin Asp Met Pro Cys Gin He Cys Tyr LeU 
15 10 15 
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5 



25 



35 



Asn Tyr Pro Asn Ser Tyr Phe 
20 



(2) INFORMATION FOR SEQ ID NO: 609: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 
10 (B) T^E: amino acid 

(D) TOPOLOGY:- linear 
(xi) SEQUENCE DESCRIPTICXfl : SEQ ID NO: 609: 

Cys Asp lie Leu Val Asp Asp Asn Thr Val Met Arg Leu lie Thr Asp 
15 1 5 10 15 

Ser Lys Val Lys Leu Lys Tyr Gin His Leu He Thr Asn Ser Phe Val 
20 25 30 

20 Glu Cys Asn Arg Leu Leu Lys Trp Cys Pro Ala Pro Asp Cys His His 
35 40 45 



Val Val Lys Val Gin Tyr Pro Asp Ala Lys Pro Val 
50 55. 60 



(2) INFORMATION FOR SEQ ID NO: 610: 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid ^ 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610: 



Cys Asp He Leu Val Asp Asp Asn Thr Val Met Arg Leu He Thr Asp 
1 .5 " 10 .15 



Ser Lys Val Lys Leu Lys Tyr Gin His Leu lie Thr Asn Ser Phe Val 
40 20 25 30 

Glu Cys Asn Arg Leu Leu Lys Trp Cys Pro Ala Pro Asp Cys His His 
35 40 45 

45 Val Val Lys Val 
50 



50 (2) INFORMATION FOR SEQ ID NO: 611: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611: 

Gly Cys Asn His Met Val Cys Arg Asn Gin Asn Cys Lys Ala Glu Phe 
15 10 15 
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cys Trp Val Cys Leu Gly Pro Trp Glu Pro His Gly Ser Ala Trp Tyr 
20 25 30 • 

Asn Cys Asn Arg Tyr Asn Glu Asp Asp Ala Lys Ala Ala Arg Asp Ala 
35 40 45 

Gin Glu Arg Ser Arg Ala Ala Leu Gin Arg Tyr Leu 
50 55 60 



(•2) INFORMATION FOR SEQ ID NO: 612: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612: 

Phe Tyr Cys Asn Arg Tyr Met Asn His Met Gin Ser Leu Arg Phe Glu 
1.5 10 15 

His Lys Leu Tyr Ala Gin Val Lys Gin Lys Met Glu Glu Met Gin Gin 
20 25 "30 

His Asn Met Ser Trp lie Glu Val Gin Phe Leu Lys Lys Ala Val Asp 
35 40 45 

Val Leu Cys Gin Cys Arg Ala Thr Leu Mfet Tyr Thr 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 613: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ^60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613: 

Tyr Val Phe Ala Phe Tyr Leu Lys Lys Asn Asn Gin Ser He He Phe 
1 5 10 15 

Glu Asn Asn Gin Ala Asp Leu Glu Asn Ala Thr Glu Val Leu Ser Gly 
20 25 30 

Tyr Leu Glu Arg Asp He Ser Gin Asp Ser Leu Gin Asp He Lys Gin 
35 40 45 

Lys Val Gin Asp Lys Tyr Arg Tyr Cys Glu Ser Arg 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 614: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOSY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614: 

Thr Gly Leu Glu Cys Gly His Lys Phe Cys Met Gin Cys Trp Ser Glu 
1 5 . 10 15 

Tyr Leu Thr Thr Lys He Met Glu Glu Gly Met Gly Gin Thr He Ser 
20 25 30 

Cys Pro Ala His Gly 
35 



(2) INFORMATI(»J FOR SEQ ID NO: 615: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 ainino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 

Met Trp Gly Tyr Leu Phe Val Asp Ala Ala Trp Asn Phe Leu Gly Cys 
1 5 10 15 

Leu He Cys Gly Trp 
20 



(2) INFORMATION FOR SEQ ID NO: 616: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 andno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616: 

Met His Phe He Ser Ser Gly Asn Val Ser Ala He Arg Ser Ser He 
15 10 15 

Leu Leu Leu Arg Xaa Ser Leu Ser Tyr Leu Gly Asn Cys Leu Arg Val 
20 25 30 

Ser Ala He Phe Val Tyr Phe Leu Leu Phe Leu Leu Leu Ser 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 617: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: 

Met Asp Gin Ala Leu Arg Gly Ser Pro Ser Glu Gly Phe Ser Thr Asp 
15 10 15 
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Pro Ser Pro Pro Gin Val Gly Arg. Gin He Pro Ser Phe Pro Pro Trp 
. 20 . 25 30 

Arg Afg Leu Val Leu Pro Lys Ala Ser Gly Cys Phe Leu Glu Arg Glu 
5 35 40 45 

Trp Trp Leu Cys Val Phe Lys Leu Arg Thr Arg Pro Gly Ala Glu Ala 
50 55 60 

10 His Ala Tyr Asn Ser Ser He Leu Gly Gly Arg Gly Lys Gly He Thr 
65 70 75 80 



15 



(2) INFOFMATION FOR SEQ ID NO: 618: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618: 



25 



Met Leu Pro Ala Leu Ala Ser Cys Cys His Phe Ser Pro Pro Glu Gin 
1 5 10 . 15 



Ala Ala Arg Leu Lys Lys Leu Gin Glu Gin Glu Lys Gin Gin Lys Val 
30 20 25 30 

Glu Phe Arg Lys Arg Met Glu Lys Glu Val Ser Asp Phe He Gin Asp 
35 40 45 

35 Ser Gly Gin He Lys Lys Lys Phe Gin Pro Met Asn Lys He Glu Arg 
50 55 60 

Ser He Leu His Asp Val Val Glu Val Ala Gly Leu Thr Ser Phe Ser 
65 70 75 80 

40 

Phe Gly Glu Asp Asp Asp Cys Arg Tyr Val Met He Phe Lys Lys Glu 
85 90 95 

Phe Ala Pro Ser Asp Glu Glu Leu Asp Ser Tyr Arg Arg Gly Glu Glu 
45 100 105 110 

Trp Asp Pro Gin Lys Ala Glu Glu Lys Arg Asn Xaa Lys Glu Leu Ala 
115 120 . 125 

50 Gin Arg Gin 
130 



55 (2) INFORMATION FOR SEQ ID NO: 619: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino adds 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 
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5 



30 



40 



666 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619: 

Glu Glu Glu Ala Ala Gin Gin Gly Pro Val Val Val Ser Pro Ala Ser 
1 5 . 10 15 

Asp Tyr Lys Asp Lys Tyr Ser His Leu lie Gly Lys Gly Ala Ala Lys 
20 25 30 



Asp Ala Ala His Met Leu Gin Ala Asn Lys Thr Tyr Gly Cys Xaa Pro 
10 35 40 45 , 

Val Ala Asn Lys Arg Asp Thr Arg Ser lie Glu Glu Ala Met Asn Glu 
50 55 60 

15 lie Arg Ala Lys Lys Arg Leu Arg Gin Ser Gly Glu 
65 70 75 



20 (2) INFORMATIC»T FOR SEQ ID NO: 620: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620: 

Pro Pro Arg Arg Pro Ala Gin Leu Pro Leu Thr Pro Gly Ala Gly Gin 
1 5 10 15 

Gly Ala Gly Arg Asp Lys Ala Ala Ala He Arg Ala His Pro Gly Ala 
20 25 30 



Pro Pro Leu Asn His Leu Leu Pro 
35 35 40 



(2) INFORMATION FOR SEQ ID NO: 621: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621: 

Ala val Pro Gin Ala Gly Gly Lys Gin Val Phe Asp Leu Ser Pro Leu 
1 5 .10 • 15 . 

50 Glu Leu Gly Tyr Val Arg Gly Met Cys Val Cys Val 
20 25 



55 (2) INFORMATION FOR SEQ ID NO: 622: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 



wo 98/54963 



PCT/US98/11422 



667 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622: 

Met Leu Pro Ala Leu Ala Ser Cys Cys His Phe Ser Pro Pro Glu Gin 
15 10 15 

Ala Ala Arg Leu Lys Lys Leu Gin Glu Gin Glu Lys Gin Gin Lys Val 
20 25 30 

Glu Phe Arg Lys Arg Met Glu Lys Glu Val Ser Asp Phe He Gin Asp 
10 35 40 45 

Ser Gly Gin He Lys Lys Lys Phe Gin Pro Met Asn Lys He Glu Arg 
50 55 60 

15 Ser He Leu His Asp Val Val Glu Val Ala Gly Leu Thr Ser Phe Ser 
65 70 75 80 

Phe Gly Glu Asp Asp Asp Cys Arg Tyr Val Met He Phe Lys Lys Glu 
85 90 95 

Phe Ala Pro Ser Asp Glu Glu Leu Asp Ser Tyr Arg Arg Gly Glu Glu 
100 105 110 



20 



Trp Asp Pro Gin Lys Ala Glu Glu Lys Arg Asn Xaa Lys Glu Leu Ala 
25 115 120 125 

Gin Arg Gin Glu Glu Glu Ala Ala Gin Gin Gly Pro Val Val Val Ser 
130 135 140 

30 Pro Ala Ser Asp Tyr Lys Asp Lys Tyr Ser His Leu He Gly Lys Gly 
145 150 155 160 



35 



45 



Ala Ala Lys Asp Ala Ala His Met Leu Gin Ala Asn Lys Thr Tyr Gly 
165 170 175 

Cys xaa Pro Val Ala Asn Lys Arg Asp Thr Arg Ser He Glu Glu Ala 
180 185 190 



Met Asn Glu He Arg Ala Lys Lys Arg Leu Arg Gin Ser Gly Glu 
40 195 200 205 



(2) INFORMATION FOR SEQ ID NO: 623: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623: 

Leu Leu Cys Pro Val Leu Asn Ser Gly Xaa Ser Trp Asn Phe Pro His 
1' 5 10 15 

55 Pro Ser Gin Pro Glu Tyr Ser Phe His Gly Phe His Ser Thr Arg Leu 
20 25 30 

Trp He 



60 
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(2) INFORMATION FOR SEQ ID NO: 624: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 624: 

Pro Ser Thr Pro Trp Phe Leu Phe Leu Leu Gly Leu Thr Cys Pro Phe 
15 10 15 

Ser Thr Ser His Pro Arg Trp Asp Ser He Pro Pro 
15 20 25 



(2) INFORMATION FOR SEQ ID NO: 625: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625: 

Glu Leu Ser He Ser He Ser Asn Val Ala Leu Ala Asp Glu Gly Glu 
15. 10 15 

30 Tyr Thr Cys Ser He Phe Thr Met Pro Val Arg Thr Ala Lys Ser Leu 
20 25 30 

Val Thr Val Leu Gly He Pro Gin Lys Pro He He Thr Gly Tyr Lys 
35 40 45 

35 

Ser Ser Leu Arg Glu Lys Asp Thr Ala Thr Leu Asn Cys Gin Ser Ser 
50 55 60 

Gly Ser Lys Pro Ala Ala Arg Leu Thr Trp Arg Lys Gly Asp Gin Glu 
40 65 70 75 80 

Leu His Gly Glu Pro Thr Arg He Gin Glu Asp Pro Asn Gly Lys Thr 
85 90 95 

45 Phe Thr Val Ser Ser Ser Val Thr Phe Gin Val Thr Arg Glu Asp Asp 
100 105 110 

Gly Ala Ser He Val Cys Ser Val Asn His Glu Ser Leu Lys Gly Ala 
115 120 125 

Asp Arg Ser Thr Ser Gin Arg He Glu Val Leu Tyr Thr Pro Thr Ala 
130 135 140 

Met He Arg Pro Asp Pro Pro His Pro Arg Glu Gly Gin Lys Leu Leu 
55 145 150 155 160 

Leu His Cys Glu Gly Arg Gly Asn Pro Val Pro Gin Gin Tyr Leu Trp 
165 170 175 

60 Glu Lys Glu Gly Ser Val Pro Pro Leu Lys Met Thr Gin Glu Ser Ala 
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180 185 190 

Leu lie Phe Pro Phe Leu Asn Lys Ser Asp Ser Gly Thr Tyr Gly Cys 
195 200 205 

5 

Thr Ala Thr Ser Asn Met Gly Ser Tyr Lys Ala Tyr Tyr Thr Leu Asn 
210 215 220 

Val Asn Asp 
10 225 



(2) INFORMATION FOR SEQ ID NO: 626: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acicis 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626: 

Glu Leu Ser He Ser He Ser Asn Val Ala Leu Ala Asp Glu Gly Glu 
15 10 15 

25 Tyr Thr Cys Ser He Phe Thr Met Pro Val Arg Thr Ala Lys Ser Leu 
20 25 30 

Val Thr Val Leu Gly He Pro Gin Lys Pro He He Thr Gly Tyr Lys 
35 40 45 

30 

Ser Ser Leu Arg Glu Lys Asp Thr Ala Thr Leu Asn Cys Gin Ser Ser 
50 55 60 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 627: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 627: 

Cys Gin Ser Ser Gly Ser Lys Pro Ala Ala Arg Leu Thr Trp Arg Lys 
1 5 10 15 

50 Gly Asp Gin Glu Leu His Gly Glu Pro Thr Arg He Gin Glu Asp Pro 
20 25 30 



55 



Asn Gly Lys Thr Phe Thr Val Ser Ser Ser Val Thr Phe Gin Val Thr 
35 40 45 

Arg Glu Asp Asp Gly Ala Ser He Val Cys Ser Val Asn His Glu Ser 
50 55 60 



Leu 
60 65 
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(2) INFOFMATION FOR SEQ ID NO: 628: 

(i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 58 amino acicSs 

(B) TYPE: amino acid 

(D> TOPOLOGY: linear 
txi) SEQUENCE DESCRIPTION: SEQ ID NO: 628: 

His Glu Ser Leu Lys Gly Ala Asp Arg Ser Thr Ser Gin Arg lie Glu 
15 10 15 

Val Leu Tyr Thr Pro Thr Ala Met lie Arg Pro Asp Pro Pro His Pro 
20 25 30 

Arg Glu Gly Gin Lys Leu Leu Leu His Cys Glu Gly Arg Gly Asn Pro 
35 40 45 

Val Pro Gin Gin Tyr Leu Tzp Glu Lys Glu 
50 55 



(2) INFORMATION FOR SEQ ID NO: 629: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTI(»I: SEQ ID NO: 629: 

Trp Glu Lys Glu Gly Ser Val Pro Pro Leu Lys Met Thr Gin Glu Ser 
1 5 10 15 

Ala Leu He Phe Pro Phe Leu Asn Lys Ser Asp .Ser Gly Thr Tyr Gly 
20 " 25 30 

Cys Thr Ala Thr Ser Asn Met Gly Ser Tyr Lys Ala Tyr Tyr Thr Leu 
35 40 45 

Asn Val Asn Asp 
50 



(2). INFORMATION FOR SEQ ID NO: 630: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630: 

Val Pro Glu Leu Pro Asp Arg Val His Gin Leu His Gin Ala Val Gin 
1 5 10 ' 15 

Gly Cys Ala Leu Gly Arg Pro Gly Phe Pro Gly Gly Pro Thr His Ser 
20 25 30 
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Gly His His Lys Ser His Pro Gly Pro Ala Gly Gly Asp Tyr Asn Arg 
35 40 45 

5 Cys Asp Arg Pro Gly Gin Val His Leu His Asn Pro Arg Gly Thr Gly 
50 55 60 



10 



20 



Arg Arg Gly Gin Leu His Pro Thr Ala Gly Pro Gly Val His Arg Arg 
65 70 75 80 

Ala Cys Pro Ser Gin Gin Leu Pro His Arg Leu Gly Pro Gly Val Pro 
85 90 95 



Cys Pro Ser Pro Ser Leu Thr Pro Val Leu Pro Ser Trp Thr Gin Ser 
15 100 105 110 



Trp Cys Gly Leu Pro Gly Tyr Thr Ser Ser Ser 
115 120 



(2) INFORMATION FOR SEQ ID NO: 631: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631: 

30 Val His Gin Leu His Gin Ala Val Gin Gly Cys Ala Leu Gly Arg Pro 
1 5 10 15 

Gly Phe Pro Gly Gly Pro 
20 

35 



(2) INFORMATION FOR SEQ ID NO: 632: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTI(»J : SEQ ID NO: .632: 

45 

Pro Thr His Ser Gly His His Lys Ser His Pro Gly Pro Ala Gly Gly 
1 5 10 15 - 

Asp Tyr Asn Arg Cys Asp Arg Pro Gly Gin Val His Leu His Asn Pro 
50 20 25 30 

Arg Gly Thr Gly Arg Arg Gly. Gin Leu His 
35 40 

55 

(2) INFORMATION FOR SEQ ID NO: 633: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTTH: 55 amino acids 
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(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633: 

5 Leu His Pro Thr Ala Gly Pro Gly Val His Arg Arg Ala Cys Pro Ser 
1.5 10 15 

Gin Gin Leu Pro His Arg Leu Gly Pro Gly Val Pro Cys Pro Ser Pro 
20 25 30 

10 

Ser Leu Thr Pro Val Leu Pro Ser Trp Thr Gin Ser Trp Cys Gly Leu 
35 40 45 

Pro Gly Tyr Thr Ser Ser Ser 
15 50 55 



20 



35 



50 



(2) INFORMATION FOR SEQ ID NO: 634: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634: 

Ser Leu Arg Arg Pro Arg Ser Ala Ala Xaa Gin Thr Leu Thr Thr Phe 
1 5 10 15 

30 Leu Ser Ser Val Ser Ser Ala Ser Ser Ser Ala Leu Pro Gly Ser Arg 
20 25 30 



Glu Pro Cys Asp Pro Arg Ala Pro Pro Pro Pro Arg Ser Gly Ser Ala 
35 40 45 

Ala Ser Cys Cys Ser Cys Cys Cys Ser Cys Pro Arg Arg Arg Ala Pro 
50 5S 60 



Leu Arg Ser Pro Arg Gly Ser Lys Arg Arg lie Arg Gin Arg Glu Val 
40 65 70 75 80 

Val Asp Leu Tyr Asn Gly Met Cys Leu Gin Gly Pro Ala Gly Val Pro 
85 90 95 

45 Gly Arg Asp Gly Ser Pro Gly Ala Asn Gly He Pro Gly Thr Pro Gly 
100 105 110 



He Pro Gly Arg Asp Gly Phe Lys Gly Glu Lys Gly Glu Cys Leu Arg 
115 120 125 

Glu Ser Phe Glu Glu Ser Trp Thr Pro Asn Tyr Lys Gin Cys Ser Trp 
130 135 140 



Ser Ser Leu Asn Tyr Gly He Asp Leu Gly Lys He Ala Glu Cys Thr 
55 145 150 155 160 

Phe Thr Lys Met Arg Ser Asn Ser Ala Leu Arg Val Leu Phe Ser Gly 
165 170 175 

60 Ser Leu Arg Leu Lys Cys Arg Asn Ala Cys Cys Gin Arg Trp Tyr Phe 
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180 185 190 

Thr Phe Asn Gly Ala Glu Cys Ser Gly Pro Leu Pro He Glu Ala He 
X95 200 205 

lie Tyr Leu Asp Gin Gly Ser Pro Glu Met Asn Ser Thr He Asn He 
210 215 220 

His Arg Thr Ser Ser Val Glu Gly Leu Cys Glu Gly He Gly Ala Gly 
10 225 230 235 240 

Leu Val Asp Val Ala He Trp Val Gly Thr Cys Ser Asp Tyr Pro Lys ' 
245 250 255 

15 Gly Asp Ala Ser Thr Gly Trp Asn Ser Val Ser Arg He He He Glu 
260 265 270 



20 



Glu Leu Pro Lys 
275 



(2) INFORMATICaJ FOR SEQ ID NO: 635: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635: 



30 



Ser Leu Arg Arg Pro Arg Ser Ala Ala Xaa Gin Thr Leu Thr Thr Phe 
1 5 . 10 15 



Leu Ser Ser Val Ser Ser Ala Ser Ser Ser Ala Leu Pro Gly Ser Arg 
35 20 25 30 

Glu Pro Cys Asp Pro Arg Ala Pro Pro Pro Pro Arg Ser Gly Ser Ala 
35 40 45 

40 Ala Ser Cys Cys Ser Cys Cys Cys Ser Cys Pro Arg Arg 
50 55 60 



45 (2) INFORMATION FOR SEQ ID NO: 636: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 636: 

Arg Ala Pro Leu Arg Ser Pro Arg Gly Ser Lys Arg Arg He Arg Gin 
15 10 15 

55 

Arg Glu Val Val Asp Leu Tyr Asn Gly Met Cys Leu Gin Gly Pro Ala 
20 25 • 30 



60 



Gly Val Pro 
35 



Gly Arg Asp Gly Ser Pro Gly Ala Asn Gly He Pro Gly 
40 45 
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Thr Pro Gly lie 
50 

5 

(2) INFORMATION FOR SEQ ID NO: 637: 

(i) SEQUENCE CHARACTERISTICS*: 
10 (A) LENGTH: 52 andno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 637: 

15 Thr Pro Gly He Pro Gly Arg Asp Gly Phe Lys Gly Glu Lys Gly Glu 
15 10 15 

Cys Leu Arg Glu Ser Phe Glu Glu Ser Trp Thr Pro Asn Tyr Lys Gin 
20 25 30 

Cys Ser Trp Ser Ser Leu Asn Tyr Gly He Asp Leu Gly Lys He Ala 
35 40 45 



20 



Glu Cys Thr Phe 
25 50 



30 



(2) INFORMATION FOR SEQ TD NO: 638: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638: 

Phe Thr Lys Met Arg Ser Asn Ser Ala Leu Arg Val Leu Phe Ser Gly 
1 5 ^' 10 15 

40 Ser Leu Arg Leu Lys Cys Arg Asn Ala Cys Cys Gin Arg Trp Tyr Phe 
20 25 30 



45 



Thr Phe Asn Gly Ala Glu Cys Ser Gly Pro Leu Pro He Glu Ala He 
35 40 45 

He Tyr Leu Asp Gin Gly Ser Pro Glu Met Asn Ser Thr He Asn He 
50 55 60 



His Arg 
50 65 



55 



(2) INFORMATION FOR SEQ ID NO: 639: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639: 
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Arg Thr Ser Ser Val Glu Gly Leu Cys Glu Gly He Gly Ala Gly Leu 
15 10 15 

5 Val Asp Val Ala He Trp Val Gly Thr Cys Ser Asp Tyr Pro Lys Gly 
20 25 30 



Asp Ala Ser Thr Gly Trp Asn Ser Val Ser Arg He He He Glu Glu 
35 40 45 



10 



15 



Leu Pro Lys 
50 



(2) INFORMATION FOR SEQ ID NO: 640: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640: 

Thr Lys Lys Glu Asn Cys Arg Pro Ala Ser Leu Met Asn He Asp thr 
25 1 5 10 15 



30 



Lys He Leu Asn Lys He Leu Met Asn Gin 
20 25 



(2) INFORMATION FOR SEQ ID NO: 641: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 214 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641: 

40 Met Cys Asn Leu Pro He Lys Val Val Cys Arg Ala Asn Ala Glu Tyr 
1 5 .10 .15 

Met Ser Pro Ser Gly Lys Val Pro Xaa Xaa His Val Gly Asn Gin Val 
20 25 30 



45 



Val Ser Glu Leu Gly Pro He Val Gin Phe Val Lys Ala Lys Gly His 
35 40 45 



Ser* Leu Ser Asp Gly Leu Glu Glu Val Gin Lys Ala Glu Met Lys Ala 
50 50 55 60 

Tyr Met Glu Leu Val Asn Asn Met lieu Leu Thr Ala Glu Leu Tyr Leu 
65 70 75 80 

55 Gin Trp Cys Asp Glu Ala Thr Val Gly Xaa He Thr His Xaa Arg Tyr 

85 • 90 95 . 



60 



Gly Ser Pro Tyr Pro Trp Pro Leu Xaa His He Leu Ala Tyr Gin Lys. 
100 105 110 
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Gin Trp Glu Val Lys Arg Lys Xaa Lys Ala lie Gly Trp Gly Lys Lys 
115 120 125 

Thr Leu Asp Gin Val Leu Glu Asp Val Asp Gin Cys Cys Gin Ala Leu 
5 130 135 140 

Ser Gin Arg Leu Gly Thr Gin Pro Tyr Phe Phe Asn Lys Gin Pro Thr 
145 150 155 160 

10 Glu Leu Asp Ala Leu Val Phe Gly His Leu Tyr Thr He Leu Thr Thr 
165 ' 170 175 

Gin Leu Thr Asn Asp Glu Leu Ser Glu Lys Val Lys Asn Tyr Ser Asn 
180 185 190 

15 

Leu Leu Ala Phe Cys Arg Arg He Glu Gin His Tyr Phe Glu Asp Arg 
195 200, 205 

Gly Lys Gly Arg Leu Ser 

20 210 



25 



40 



(2) INFORMATION FOR SEQ ID NO: 642; 



(i) SEQUENCE CHARACTERISTICS: 

(A) Le«3TH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTlCaJ : SEQ ID NO: 642: 

Met Cys Asn Leu Pro He Lys Val Val Cys Arg Ala Asn Ala Glu Tyr 
15 10 15 

35 Met Ser Pro Ser Gly Lys Val Pro Xaa Xaa His Val Gly Asn Gin Val 
20 25 30 



Val Ser Glu Leu Gly Pro He Val Gin Phe Val Lys 
35 40 



(2) INFORMATION FOR SEQ ID NO:' 643: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGV: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643: 

50 

Phe Val Lys Ala Lys Gly His Ser Leu Ser Asp Gly Leu Glu Glu Val 
1 5 . 10 15 

Gin Lys Ala Glu Mfet Lys Ala Tyr Met Glu Leu Val Asn Asn Met Leu 
55 20 .25 30 

Leu Thr Ala Glu Leu Tyr Leu Gin Trp Cys Asp Glu 
35 40 



60 
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(2) INFORMATION FOR SEQ ID NO: 644: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 

10 Leu Gin Trp Cys Asp Glu Ala Thr Val Gly Xaa He Thr His Xaa Arg 
1 5 10 15 

Tyr Gly Ser Pro Tyr Pro Trp Pro Leu Xaa His He Leu Ala Tyr Gin 
20 25 30 

Lys Gin Trp Glu Val Lys Arg Lys Xaa Lys Ala He Gly Trp Gly Lys 
35 40 45 



15 



Lys Thr Leu 

20 50 



25 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 645: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645: 

Asp Gin Val Leu Glu Asp Val Asp Gin Cys Cys Gin Ala Leu Ser Gin 
15 10 15 

35 Arg Leu Gly Thr Gin Pro Tyr Phe Phe Asn Lys Gin Pro Thr Glu Leu 
. 20 25 30 



^ Asp Ala Leu Val Phe Gly His Leu Tyr Thr He 
35 40 



(2) INFORMATION FOR SEQ ID NO: 646: 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 
• <B) TYPE: amino acid 
(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCR1PTIC»J: SEQ ID NO: 646: 



Leu Thr Thr Gin Leu Thr Asn Asp Glu Leu Ser Glu Lys Val Lys Asn 
1 5 10 15 



Tyr Ser Asn Leu Leu Ala Phe Cys Arg Arg He Glu Gin His Tyr Phe 
55 20 25 30 



Glu Asp Arg Gly Lys Gly Arg Leu Ser 
35 40 



60 
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(2) INFORMATION FOR SEQ ID NO: 647: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
'(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 647: 

10 Met Xaa Xaa Xaa Asn Ser His He Thr He Fhe Thr Leu Asn Val Asn • 
1 5 10 

Gly Leu Asn Ala Pro Asn Glu Arg His Arg Leu Ala Asn Trp He Gin 
20 25 30 

Ser Gin Asp Gin Val Cys Cys He Gin Glu Thr His Leu Thr Gly Arg 
35 40 45 



15 



Asp Thr His Arg Leu Lys He Lys Gly Trp Arg Lys He Tyr Gin Ala 
20 50 55 60 



25 



50 



55 



Asn Gly Lys Gin Lys Lys 
65 70 



(2) INFORMATION FOR SEQ ID NO: 648: 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 648: 

35 Phe Thr Leu Asn Val Asn Gly Leu Asn Ala Pro Asn Glu Arg His Arg 
15 10 15 

Leu Ala Asn Trp He Gin Ser Gin Asp Gin Val Cys 
20 25 

40 

(2) INFORMATION FOR SEQ ID NO: 649: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 649: 



Thr His Leu Thr Gly Arg Asp Thr His Arg Leu Lys He Lys Gly Trp 
15 10 15 



Arg 



(2) INFORMATION FOR SEQ ID NO: 650: 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids < 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650: 

Gly Trp Arg Lys He Tyr Gin Ala Asn Gly Lys Gin Lys Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 651: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651: 

He Tyr His Leu His Ser Trp He Phe Phe His Phe Lys Arg Ala Phe 
15 10 15 

Cys Met Cys Phe He Thr Met Lys Val He His Ala His Cys Ser Lys 
20 25 30 

Leu Arg Lys Cys Xaa Asn Ala Gin He Ser Val Phe Cys Thr Thr Leu 
35 40 45 

Thr Ala Ser Tyr Pro Thr 
50 



(2) INFORMATION FOR SEQ ID NO: 652: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652: 

He Tyr His Leu His Ser Trp He Phe Phe His Phe Lys Arg Ala Phe 
1.5 10 15 

Cys Met Cys Phe He Thr Met 
20 • 



(2) INFORMATION FOR SEQ ID NO: 653: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 653: 



Lys Val He His Ala His Cys Ser Lys Leu Arg Lys Cys Xaa Asn Ala 
15 10 15 



wo 98/54963 



680 



Gin lie Ser Val Phe Cys Thr Thr Leu Thr Ala Ser Tyr Pro Thr 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 654: 

{i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654: 

Trp Asn Leu Leu Trp Tyr Phe Gin Arg Leu Arg Leu Pro Ser lie Leu 
1 5 10 15 

Pro Gly Leu Val Leu Ala Ser Cys Asp Gly Pro Ser Xaa Ser Gin Ala 
20 25 30 

Pro Ser Pro Trp Leu Thr Pro Asp Pro Ala Ser Val Gin Val Arg Leu 
35 40 45 

Leu Trp Asp Val Leu Thr Pro Asp Pro Asn 
50 55 



(2) INF0RMATIC»7 FOR SEQ ID NO: 655: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655: 

Gin Arg Gly lie Tyr Arg Glu lie Leu Phe Leu Thr Met Ala Ala Leu 
1 5 10 15 

Gly Lys Asp His Val Asp lie Val Ala Phe Asp Lys Lys Tyr Lys Ser 
20 25 30 

Ala Phe Asn Lys Leu Ala Ser Ser Met Gly Lys Glu Glu Leu Arg His 
35 40 45 

Arg Arg Ala Gin Met Pro 
50 



(2) INF0HMATXC»7 FOR SEQ ID NO: 656: 

(i) SEQU^CE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 656: 

Trp Asn Leu Leu Trp Tyr Phe Gin Arg Leu Arg Leu Pro Ser lie Leu 
15 10 15 
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Pro Gly Leu Val Leu Ala Ser 
20 



(2) INFORMATION FOR SEQ ID NO: 657: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 657: 

Glu Asp Asp Gly Phe Asn Arg Ser lie His Glu Val lie Leu Lys Asn 
1 5 10 15 

lie Thr Trp Tyr Ser Glu Arg Val Leu Thr Glu lie Ser Leu Gly Ser 
20 25 30 

Leu Leu lie Leu Val Val lie Arg Thr lie Gin Tyr Asn Met Thr Arg 
35 40 45 

Thr Arg Asp Lys Tyr Leu His Thr Asn Cys Leu Ala Ala Leu Ala Asn 

50 55 60 

Met Ser Ala Gin Phe Arg Ser Leu His Gin Tyr Ala Ala Gin Arg lie 
65 70 75 80 

lie Ser Leu Phe Ser Leu Leu Ser Lys Lys His Asn Lys Val Leu Glu 
85 90 95 

Gin Ala Thr^ Gin Ser Leu Arg Gly Ser Leu Ser Ser Asn Asp Val Pro 
100 105 110 

Leu Pro Asp Tyr Ala Gin Asp Leu Asn Val lie Glu Glu Val lie Arg 
115 120 125 

Met Met Leu Glu lie lie Ash Ser Cys Leu Thr Asn Ser Leu His His 
130 135 140 

Asn Pro Asn Leu Val Tyr Ala Leu Leu Tyr Lys Arg Asp Leu Phe Glu 
145 150 155 160 

Gin Phe Arg Thr His Pro Ser Phe Gin Asp He Met Gin Asn He Asp 
165 170 175 

Leu Val He Ser Phe Phe Ser Ser Arg Leu Leu Gin Ala Gly Ser 
180 185 190 



(2) INFORMATION FOR SEQ ID NO: $58: 

(i) SEQUENCE CH/U^CTERISTICS: 

(A) LE2IGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658: 



Glu Asp Asp Gly Phe Asn Arg Ser He His Glu Val He Leu Lys Asn 



wo 98/54963 



I 

682 



1 5 10 15 

He Thr Trp Tyr Ser Glu Arg Val Leu Thr Glu He Ser Leu Gly Ser 
20 25 30 

Leu Leu He Leu Val Val 
35 



(2) INFOBMATIQN FOR SEQ ID NO: 659: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659: 

Arg Thr He Gin Tyr Asn Met Thr Arg Thr Arg Asp Lys Tyr Leu His 
1 5 10 15 

Thr Asn Cys Leu Ala Ala Leu Ala Asn Met Ser Ala Gin Phe Arg Ser 
20 25 30 

Leu His Gin Tyr Ala Ala Gin Arg He He Ser Leu Phe Ser Leu Leu 
35 40 45 

Ser Lys Lys His Asn 
50 



(2) INFORMATION FOR SEQ ID NO: 660: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660: 

Ser Cys Leu Thr Asn Ser Leu His His Asn Pro Asn Leu Val Tyr Ala 
15 10 15 

Leu Leu Tyr Lys Arg Asp Leu Phe Glu Gin Phe Arg Thr His Pro Ser 
20 25 30 

Phe Gin Asp He Met Gin Asn He Asp Leu Val He Ser Phe Phe Ser 
35 . 40 45 

Ser Arg Leu Leu Gin Ala Gly Ser 
50 55 



(2) INFORMATION FOR SEQ ID NO: 661: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 axnino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 661: 

Lys Lys His Asn Lys Val Leu Glu Gin Ala Thr Gin Ser Leu Arg Gly 
15 10 15 

Ser Leu Ser Ser Asn Asp Val Pro Leu Pro Asp Tyr Ala Gin Asp 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 662: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 662:' 



Met Ala Asp He Gin Thr Glu Arg Ala Tyr Gin Lys Gin Pro Thr He 
15 10 15 

Phe Gin Asn Lys Lys Arg Val Leu Leu Gly Glu Thr Gly Lys Glu Lys 
20 25 30 

Leu Pro Arg Val Thr Asn Lys Asn He Gly Leu Gly Phe Lys Asp Thr 
35 40 45 

Pro Arg Arg Leu Leu Arg Gly Thr Tyr He Asp Lys Lys Cys Pro Phe 
50 55 60 

Thr Gly Asn Val Ser He Arg Gly Arg He Leu Ser Gly Val Val Thr 
65 70 75 80 

Gin Asp Glu Asp Ala Glu Asp His Cys His Pro Pro Arg Leu Ser Ala 
85 90 95 

Leu His Pro Gin Val Gin Pro Leu Arg Glu Ala Pro Gin Glu His Val 
100 " ■ 105 110 

Cys Thr Pro Val Pro Leu Leu Gin Gly Arg Pro Asp Arg 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 663: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663: 

Met Lys Met Gin Arg Thr He Val He Arg Arg Asp Tyr Leu His Tyr 
15 10 . 15 

He Arg Lys Tyr Asn Arg Phe Glu Lys Arg His Lys Asn Met Ser Val ^ 
20 25 30 



His Leu Ser Pro Cys Phe Arg Asp Val Gin He Gly Asp He Val Thr 
35 40 45 
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"=1 Gly 31u Cys irg ?r= Leu Ser Ihr Val Arg Piie Asr. V=l 
=j 55 50 

Val Thr Lys Ala Ala Gly rr^ Lys l>-s Glr. ?he GI^ lys 7ta 
z5 . t: 75 



{2) HZFZrlGCZOa FOR SEQ ID 170: 5£4: 
(i) SSQOHZIC 



(A) LZT/GTH; 30 



(B) TV?H: ^r.-lno 2.cid 

(xi) SSCUSICE ISSCP-PTICJI: SS; :r NO: 654: 

Me- Ala Asp rie Gin rhr Glu Arr Ala Ti.-r Gin lys Gl=. ?rs Tiz: 
15 10 1= 

Phe Gin Asn lys lys Arg Val Ler^ leu Gly GIu Thr Gly Lys 
20 25 2: 



(2) rSSTrMATlON FOR SEQ NO: 555: 

'i^ sz7Jz::c2 c»ja;C::zristi::s: 

(A) IsrSGTH: 58 2=iiro adds 

(B) TV?E: arino acid 
(D) TCPCLZGy: lir.ear 

(Xi) SBQUZ^TCZ I;ESC?-?TI:!M: SE;; H NC: 555: 

lys l^u rrc Arg Val Thr Asr. Lys A^. lie T^y Leu Gly ?r.s Ir/z A^ 

1 5 i: 15 

Thr Pro Arg Arg Leu Leu ATg Gly Ihr T^-r lie A3? lys l^.-s t-.-s 
20 25 3: 

?'r-e Thr Gly Asn Val Ser He Arr Gly A^rg lie leu Ser Gly Val Val 
35 4: 4= 

Thr Gin A-sp Glu Asp Ala Glu Asp His Oys 
50 55 



(2) IZuFCrMATIGN FOR SEQ lU NO: 555: 



(A) 

(5) TYPE: a=dno acid 
(1) rOPClCGy: linear 
(xi) SEQCHNCS nESCRIPTXCK: SE; ID NC: 655: 

Kis Cys His Pro Pro Arg Leu Ser Ala l^a His Pre Glr. Val Gin Pro 
15 10 15 



Leu Arg Glu Ala Pro Gin Glu Kis Val Cys Thr Pre Val Pro Isu 1 
20 25 3Z 
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Gin Gly Arg Pro Asp Arg 
35 

5 

(2) INFORMATION FOR SEQ ID NO: 667: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 36 amino acids 

(B) TYPE; amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667: 

15 Met Lys Met Gin Arg Thr lie Val lie Arg Arg Asp Tyr Leu His Tyr 
1 5 * . 10 15 

lie Arg Lys Tyr Asn Arg Phe Glu Lys Arg His Lys Asn Met Ser Val 
20 25 30 

20 

His Leu Ser Pro 
35 



25 

(2) INFORMATION FOR SEQ ID NO: 668: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668: 

Cys Phe Arg Asp Val Gin He Gly Asp He Val Thr Val Gly Glu Cys 
35 1 5 10 15 

Arg Pro Leu Ser Lys Thr Val Arg Phe Asn Val Leu Lys Val Thr Lys 
20 " 25 30 

40 Ala Ala Gly Thr Lys Lys Gin Phe Gin Lys Phe 
35 40 



45 (2) INFORMATION FOR SEQ ID NO: 669: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669: 



55 



Pro Arg Arg Leu Leu Arg Gly Thr Tyr He Asp Lys Lys CVs Pro Phe 
1 5 10 . 15 

Thr Gly Asn Val Ser He Arg Gly Arg He Leu Ser Gly Val Val Thr 
20 25 30 



Gin 

60 
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(2) IMFORMATIQM FOR SEQ ID MO: 670: 

(i) SEQUENCE CHAKACTERISTICS : 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 670: 



He Phe Tyr Asp Ser Asp Trp Asn 
1 5 

Asp Arg Ala His Arg Leu Gly Gin 
20 

Leu He Cys Lys Gly Thr He Glu 
35 40 



Pro Thr Val Asp Gin Gin Ala Met 
10 15 

Thr Lys Gin Val Thr Val Tyr Arg 
25 30 

Glu Arg He Leu Gin Arg Ala Lys 
45 



Glu Lys Ser Glu He Gin Arg Met Val He Ser Gly 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 671: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671: 

Thr Arg Met He Asp Leu Leu Glu Glu Tyr Met Val Tyr Arg Lys His 
1 5 10' 15 

Thr Tyr Xaa Arg Leu Asp Gly Ser Ser Lys He Ser Glu Arg Arg Asp 
20 * 25 30 

Met Val Ala Asp Phe Gin Asn Arg Asn Asp He Phe Val Phe Leu Leu 
35 40 45 

Ser Thr Arg Ala Gly Gly Leu Gly He Asn Leu Thr Ala Xaa Asp Thr 
50 55 60 

Val His Phe 
65 



(2) INFORMATION FOR SEQ ID NO: 672: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 672: 



He Phe Tyr Asp Ser Asp Trp Asn Pro Thr Val Asp Gin Gin Ala Met 
X 5 10 15 
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Asp Arg Ala His Arg Ijeu Gly Gin Thr Lys Gin Val Thr Val Tyr Arg 
20 25 30 



10 (2) INFORMATION FOR SEQ ID NO: 673: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673: 

Val Tyr Arg Leu lie Cys Lys Gly Thr He Glu Glu Arg He Leu Gin 
1 ■ 5 10 15 

20 

Arg Ala Lys Glu Lys Ser Glu He Gin Arg Met Val He Ser Gly 
20 25 30 



25 

(2) INFORMATION FOR SEQ ID NO: 674: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674: 

Thr Arg Met He Asp Leu Leu Glu Glu Tyr Met Val Tyr Arg Lys His 
35 1 .5 10 15 

Thr Tyr Xaa Arg Leu Asp Gly Ser Ser Lys He Ser Glu Afg Arg Asp 
20 25 30 

40 Mfet 



45 (2) INFORMATICai FOR SEQ ID NO: 675: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675: 



55 



Arg Arg Asp Met Val Ala Asp Phe Gin Asn Arg Asn Asp He Phe Val 
15 10 15 

Phe Leu Leu Ser Thr Arg Ala Gly Gly Leu Gly He Asn Leu Thr Ala 
20 25 30 



Xaa Asp Thr Val His Phe 
60 35 
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(2) INFORMATION FOR SEQ ID NO: 676: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 676: 

He Phe Tyr Asp Ser Asp Trp Asn Pro Thr Val Asp Gin Gin Ala Met 
15 10 15 

Asp Arg Ala His Arg Leu Gly Gin Thr Lys Gin Val Thr Val Tyr Arg 
20 25 30 

Leu He Cys Lys Gly 
35 



(2) INFORMATION FOR SEQ ID NO: 677: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677: 

He Phe Tyr Asp Ser Asp Trp Asn Pro Thr Val Asp Gin Gin Ala Met 
15 10 15 

Asp Arg Ala His Arg Leu Gly Gin Thr Lys Gin Val Thr Val Tyr Arg 
20 25 30 

Leu He Cys Lys Gly 
35 



(2) INFORMATICXf FOR SEQ 10 NO: 678: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE PESCRIPTION: SEQ ID NO: 678: 

Arg Leu He Cys Lys Gly Thr He Glu Glu Arg He Leu Gin Arg Ala 
15 10 15 

Lys Glu Lys Ser Glu He Gin Arg Met Val He Ser Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 679: 
(i) SEQUENCE CHARACTERISTICS': 
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(A) LENGTH: 364 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 679: 

Met Ser Leu His Gly Lys Arg Lys Glu lie Tyr Lys Tyr Glu Ala Pro 
15 10 15 

Trp Thr Val Tyr Ala Met Asn Trp Ser Val Arg Pro Asp Lys Arg Phe 
20 25 , 30 

Arg Leu Ala Leu Gly Ser Phe Val Glu Glu Tyr Asn Asn Lys Val Gin 
35 40 45 

Leu Val Gly Leu Asp Glu Glu Ser Ser Glu Phe lie Cys Arg Asn Thr 
50 55 60 

Phe Asp His Pro Tyr Pro Thr Thr Lys Leu Met Trp lie Pro Asp Thr 
65 70 75 80 

Lys Gly Val Tyr Pro Asp Leu Leu Ala Thr Ser Gly Asp Tyr Leu Arg 
85 • 90 95 

Val Trp Arg Val Gly Glu Thr Glu Thr Arg Leu Glu Cys Leu Leu Asn 
100 105 110 

Asn Asn Lys Asn .Ser Asp Phe Cys Ala Pro Leu Thr Ser Phe Asp Trp 
115 120 125 

Asn Glu VcU. Asp Pro Tyr Leu Leu Gly Thr Ser Ser lie Asp Thr Thr 
130 135 140 

Cys Thr lie Trp Gly Leu Glu Thr Gly Gin Val Leu Gly Arg Val Asn 
145 150 155 160 

Leu Val Ser Gly His Val Lys Thr Gin Leu lie Ala His Asp Lys Glu 
165 170 175 

Val Tyr Asp He Ala Phe Ser Arg Ala Gly Gly Gly Arg Asp Met Phe 
180 185 190 

Ala Ser Val Gly Ala Asp Gly Ser Val Arg Met Phe Asp Leu Arg His 
195 200 205 

Leu Glu His Ser Thr He He Tyr Glu Asp Pro Gin His His Pro Leu 
210 215 220 

Leu Arg Leu Cys Trp Asn Lys Gin Asp Pro Asn Tyr Leu Ala Thr Met 
225 230 235 240 

Ala Met Asp Gly Met Glu Val Val He Leu Asp Val Arg Val Pro Ala 
245 250 255 

His Leu Xaa Pro Gly Thr Thr He Glu His Val Ser Met Ala Leu Leu 
260 265 270 

Gly Pro His He His Pro Ala Thr Ser Ala Leu Gin Arg Met Thr Thr 
275 280 285 



Arg Leu Ser Ser Gly Thr Ser Ser Lys Cys Pro Glu Pro Leu Arg Thr 
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290 295 300 

Leu Ser Trp Pro Thr Gin Leu Xaa Gly Glu lie Asn Asn Val Gin Trp 
305 310 315 320 

Ala Ser Thr Gin Pro Glu Leu Ser Pro Ser Ala Thr Thr Thr Ala Trp 
325 330 335 

Arg Tyr Ser Glu Cys Ser Val Gly Gly Ala Val Pro Thr Arg Gin Gly 
340 345 350 

Leu Leu Tyr Phe Leu Pro Leu Pro His Pro Gin Ser 
355 360 



(2) INFORMATION FOR SEQ ID NO: 680; 

<i) SEQTffENCE CHARACTERISTICS: 

(A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 680: 

Met Ser Leu His Gly Lys Arg Lys Glu lie Tyr Lys Tyr Glu Ala Pro 
15 10 15 

Trp Thr Val Tyr Ala Met Asn Trp Ser Val Arg Pro Asp Lys Arg Phe 
20 25 30 

Arg Leu Ala Leu Gly Ser Phe Val Glu Glu Tyr Asn Asn Lys Val Gin 
35 . 40 45 

Leu Val Gly Leu Asp Glu Glu Ser Ser Glu Phe lie Cys Arg Asn Thr 
50 55 60 

Phe Asp His Pro Tyr Pro Tbr Thr Lys Leu Mtet Trp lie Pro Asp Thr 
65 70 75 80 

Lys Gly Val Tyr Pro Asp Leu Leu Ala Thr Ser Gly Asp Tyr Leu Arg 
85 90 95 

Val Trp Arg Val Gly Glu Thr Glu Thr Arg Leu Glu Cys Leu Leu Asn 
100 105 110 

Asn Asn Lys Asn Ser Asp Phe Cys Ala Pro Leu Thr Ser Phe Asp Trp 
115 120 125 

Asn Glu Val Asp Pro Tyr Leu Leu 
130 135 



(2) INFORMATION FOR SEQ ID NO: 681: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: axnino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681: 



wo 98/54963 



691 



Ser Phe Asp Trp Asn Glu Val Asp Pro Tyr Leu Leu Gly Thr Ser Ser 
15 10 IS 

He Asp Thr Thr Cys Thr He Trp Gly Leu Glu Thr Gly Gin Val Leu 
20 25 30 

Gly Arg Val Asn Leu Val Ser Gly His Val Lys Thr Gin Leu He Ala 
35 40 45 

His Asp Lys Glu Val Tyr Asp He Ala Phe Ser Arg Ala Gly Gly Gly 
50 55 60 

Arg Asp Met Phe Ala Ser Val Gly Ala Asp Gly Ser Val Arg Met Phe 
65 70 75 80 

Asp Leu Arg His Leu Glu His Ser Thr He He Tyr Glu Asp Pro Gin 
85 90 95 

His His Pro Leu Leu Arg Leu Cys Trp Asn Lys Gin Asp Pro Asn Tyr 
100 105 110 

Leu Ala Thr Met Ala Met Asp Gly Met Glu Val Val He Leu Asp Val 
115 120 125 

Arg Val Pro Ala His Leu Xaa Pro Gly Thr Thr He 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 682: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: lineeo: 

(xi)'SEQU^CE ISISCRIPTION: SEQ ID NO: 682: 

Val Gly Ala Asp Gly Ser Val Arg Met Phe Asp Leu Arg His Leu Glu 
15 10 15 

His Ser Thr He He Tyr Glu Asp Pro Gin His His Pro Leu Leu Arg 
20 25 30 

Leu Cys Trp Asn Lys Gin Asp Pro Asn Tyr Leu Ala Thr Met Ala M^t 
35 40 45 

Asp Gly Met Glu Val Val He Leu Asp Val Arg Val Pro Ala His Leu 
50 55 60 

Xaa Pro Gly Thr Thr He Glu His Val Ser Met Ala Leu Leu Gly Pro 
65 70 75 80 

His He His Pro Ala Thr Ser Ala Leu Gin Arg Met Thr Thr Arg Leu 
85 90 95 

Ser Ser Gly Thr Ser Ser Lys Cys Pro Glu Pro Leu Arg Thr Leu Ser 
100 105 HO 



Trp Pro Thr Gin Leu Xaa Gly Glu He Asn Asn Val Gin Trp Ala Ser 
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115 120 125 

Thr Gin Pro Glu Leu Ser Pro Ser Ala Thr Thr Thr Ala Trp Arg Tyr 
130 135 140 

Ser Glu Cys Ser Val Gly Gly Ala Val Pro Thr Arg Gin Gly Leu Leu 
145 150 155 160 

Tyr Phe Leu Pro Leu Pro His Pro Gin Ser 
165 170 



(2) INFORMATiai FOR SEQ ID NO: 683: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683: 

Leu Tyr Ala Thr Ala Thr Val lie Ser Ser Pro Ser Thr Glu Xaa Leu 
1 5 . 10 15 

Ser Gin Asp Gin Gly Asp Arg Ala Ser Leu Asp Ala Ala Asp Ser Gly 
20 25 30 

Arg Gly Ser Trp Thr Ser Cys Ser Ser Gly Ser His Asp Asn He Gin 
35 40 45 

Thr He Gin His Gin Arg Ser Trp Glu Thr Leu Pro Phe Gly His Thr 
50 55 60 

His Phe Asp Tyr Ser Gly Asp Pro Ala Gly Leu Trp Ala Ser Ser Ser 
65 . 70 75 80 

His Met Asp Gin He Met Phe Ser Asp His Ser Thr Lys Tyr Asn Arg 
85 ' 90 95 

Gin Asn Gin Ser Arg Glu Ser Leu Glu Gin Ala Gin Ser Arg Ala Ser 
100 105 110 

Trp Ala Ser Ser Thr Gly Tyr Trp Gly Glu Asp Ser Glu Gly Asp Thr 
115 120 125 

Gly Thr He Lys Arg Arg Gly Gly Lys Asp Val Ser He Glu Ala Glu 
130 135 140 

Ser Ser Ser Leu Thr Ser Val Thr Thr Glu Glu Thr Lys Pro Val Pro 
145 150 155 160 

Met Pro Ala His He Ala Val Ala Ser Ser Thr Thr Lys Gly Leu He 
165 170 175 

Ala Arg Lys Glu Gly Arg Tyr Arg Glu Pro Pro Pro Thr Pro Pro Gly 
180 185 190 

Tyr He Gly He Pro He Thr Asp Phe Pro Glu Gly His Ser His Pro 
195 200 205 
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Ala Arg Lys Pro Pro Asp Tyr Asn Val Ala Leu Gin Arg Ser Arg Met 
210 215 220 

Val Ala Arg Ser Ser Asp Thr Ala Gly Pro Ser Ser Val Gin Gin Pro 
225 230 235 240 

His Gly His Pro Thr Ser Ser Arg Pro Val Asn Lys Pro Gin Trp His 
245 250 255 

Lys Xaa Asn Glu Ser Asp Pro Arg Leu Ala Pro Tyr Gin Ser Gin Gly 
260 265 270 

Phe Ser Thr Glu Glu Asp Glu Asp Glu Gin Val Ser -Ala Val 
275 280 285 



(2) INFORMATION FOR SEQ ID NO: 684: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684: 

His Met Asp Gin lie Met Phe Ser Asp His Ser Thr Lys Tyr Asn Arg 
15 10 15 

Gin Asn Gin Ser Arg Glu Ser Leu Glu Gin Ala Gin Ser Arg Ala Ser 
20 25 30 

Trp Ala Ser Ser Thr Gly Tyr Trp Gly Glu 
35 * 40 



(2) INFORMATION FOR SEQ IQ NO: 685: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685: 

Ser Val Thr Thr Glu Glu Thr Lys Pro Val Pro Met Pro Ala His He 
1 5 10 15 • 

Ala Val Ala Ser Ser Thr Thr Lys Gly Leu He Ala Arg Lys Glu Gly 
20 25 30 

Arg Tyr Arg Glu Pro Pro Pro Thr Pro Pro Gly Tyr He Gly He Pro 
35 40 45 

He Thr Asp 
50 



(2) INFORMATION FOR SEQ ID NO: 686: 
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(A"; ZZ^TTTr.: 57 aniino acids 
(Bi T/PE; aninc acid 
{::) TOPCICG-/: iir.ear 
(xi) S2QUH::CZ rZSCPTpr^cii: SEQ id NO: 686: 

Val Ala Leu Gin Arg Ser Ajrg l-ler Val Ala Arg Ser Ser Asp Thr Ala 
15 10 15 

31y Pro Ser Ser Val Glr. Gin Pro His Gly His Pro Thr Ser Ser Arg 
20 . 25 30 

Pro Val A^r. Lys Pro Gin Trp Kis Lys Xaa Asn Glu Ser Asp Pro Arg 
35 ij 45 

Leu Ala Pro Tyr Gin Ser Gin Gly Phe 
50 55 



(2) I^IFORI'GTICN FOR SSQ ZD NO: 687: 

(i) S22feX3 O-^JJACTEHISTICS: 

(A.) LrNGTH: 41 andno acids 

(B) TYPE: amnc acid 

iD) TOPCICG7: linear 
(xi) SEQUH:TCE DESCrJiPTICM: SEQ ID NO: 687: 

Cys Leu Leu Phe Val Phe Val Ser Leu Gly Met Arg Cys Leu Phe Trp 
15 10 15 

Thr lis Val Tyr A^. Val Leu Tyr Leu Lys His Lys Cys Asn Thr Val 
20 25 30 

Leu Leu Cys Tyr His Leu Cys Ser lie 
35 40 



(2) XKFOHZ^iaN FOR SEQ ID NC: 688: 

(i) SEQL^ICE CK5-3AJCTEP-STICS: 

(A) LENGTH: 67 ainino acids 

(3) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 688: • 

Ala Cys Ser Lys Leu lie Pro A^a Phe Glu Met Val Met Arg Ala Lys 
15 10 15 

Asp Asn Val Tyr His Leu Asp Cys Phe Ala Cys Gin Leu Cys Asn Gin 
20 25 . 30 

Arg Xsia Cys Val Gly Asp Lys Phe Phe Leu Lys Asn Asn Xaa Xaa Leu 
35 40 45 

Cys Gin Thr Asp Tyr Glu Glu Gly Leu Met Lys Glu Gly Tyr Ala Pro 
50 55 60 



Xaa Val Arg 
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65 



(2) INFORMATION FOR SEQ ID NO: 689: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 689: 

Ser Ala Leu Ser Glu Pro Gly Ala Pro Asp Arg Arg Arg Pro Cys Pro 
1 5 10 15 

Glu Ser Val Pro Arg Arg Pro Asp Asp Glu Gin Trp Pro Pro Pro Thr 
20 25 . 30 

Ala Leu Cys Leu Asp Val Ala Pro Leu Pro Pro Ser Ser 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 690: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690: 

Pro Val Gly Tyr Leu Asp Lys Gin Val Pro Asp Thr Ser Val Gin Glu 
15 10 15 

Thr Asp Arg lie Leu Val Glu Lys Arg Cys Trp Asp lie Ala Leu Gly 
20 25 30 

Pro Leu Lys Gin lie Pro Met Asn Leu Phe He 
35 40 



<2) INFORMATION FOR SEQ ID NO: 691: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 214 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 691: 

Ala His Ala Ser Glu Ser Gly Glu Arg Trp Trp Ala Cys Cys Gly Val 
15 10 . 15 

Arg Phe Gly Leu Arg Ser He Glu Ala He Gly Arg Ser Cys Cys His 
20 25 30 

Asp Gly Pro Gly Gly Leu Val Ala Asn Arg Gly Arg Arg Phe Lys Trp 
35 40 45 

Ala He Glu Leu Ser Gly Pro Gly Gly Gly Ser Arg Gly Arg Ser Asp 
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50 55 60 

Arg Gly Ser Gly Gin Gly Asp Ser Leu Tyr Pro Val Gly Tyr Leu Asp 
65 70 75 80 

Lys Gin Val Pro Asp Thr Ser Val Gin Glu Thr Asp Arg He Leu Val 
85 90 95 

Glu Lys Arg Cys Trp Asp He Ala Leu Gly Pro Leu Lys Gin He Pro 
100 105 110 

Met Asn Leu Phe He Met Tyr Met Ala Gly Asn Thr He Ser He Phe 
115 120 125 

Pro Thr Met Met Val Cys Met Met Ala Trp Arg Pro He Gin Ala Leu 
130 135 140 

Met Ala He Ser Ala Thr Phe Lys Met Leu Glu Ser Ser Ser Gin Lys 
145 150 155 160 

Phe Leu Gin Gly Leu Val Tyr Leu He Gly Asn Leu Met Gly Leu Ala 
165 170 175 

Leu Ala Val Tyr Lys Cys Gin Ser Met Gly Leu Leu Pro Thr His kla 
180 185 190 

Ser Asp Trp Leu Ala Phe He Glu Pro Pro Glu Arg Met Glu Phe Ser 
195 200 205 

Gly Gly Gly Leu Leu Leu 
210 



(2) INFORMATI€»I FOR SEQ ID NO: 692: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692: - 

Ala Thr Phe Lys Met Leu Glu Ser Ser Ser Gin Lys Phe Leu Gin Gly 
1 5 10 15 

Leu Val Tyr Leu He Gly Asn. Leu Met Gly Leu Ala Leu Ala Val Tyr 
20 25 30 

Lys Cys Gin Ser Met Gly Leu Leu Pro Thr His Ala Ser Asp 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 693: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 43 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693: 
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Pro Val Gly Tyr Leu Asp Lys Gin Val Pro Asp Thr Ser Val Gin Glu 
IS 10 15 

Thr Asp Arg He Leu Val Glu Lys Arg Cys Trp Asp He Ala Leu Gly 
20 .25 30 

Pro Leu Lys Gin He Pro Met Asn Leu Phe He 
35 40 



(2) INFORMATION FOR SEQ ID NO: 694: 

|i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

IB) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694: 

Pro Thr Thr Lys Leu Asp He Met Glu Lys Lys Lys His He Gin He 
1 5 10 15 

Arg Phe Pro Ser Phe Tyr His Lys Leu Val Asp Ser Gly Arg Met Arg 
20 25 30 

Ser Lys Arg Glu Thr Arg Arg Glu Asp Ser Asp Thr Lys His Asn Leu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 695: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695: 

Thr Glu His He He Ala Val Met He Thr Glu Leu Arg Gly Lys Asp 
IS 10 15 

He Leu Ser Tyr Leu Glu Lys Asn He Ser Val Gin Met Thr He Ala 
20 25 30 

Val Gly Thr Arg Met Pro Pro Lys Asn Phe Ser Arg Gly Ser Leu Val 
35 40 45 

Phe Val Ser He Ser Phe He Val Leu Met He He Ser Ser Ala Trp 
50 55 60 

Leu He Phe Tyr Phe He Gin Lys He Arg Tyr Thr Asn Ala Arg Asp 
65 * 70 75 80 

Arg Asn Gin Arg Arg Leu Gly Asp Ala Ala Lys Lys Ala He Ser Lys 
85 90 95 
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Leu Thr Thr Arg Thr Val Lys Lys Gly Asp Lys Glu Thr Asp Pro Asp 
100 105 110 

Phe Asp His Cys Ala Val Cys He Glu Ser Tyr Lys Gin Asn Asp Val 
115 120 125 

Val Arg He Leu Pro Cys Lys His Val Phe His Lys Ser Cys Val Asp 
130 135 140 

Pro Trp Leu Ser Glu His Cys Thr Cys Pro Met Cys Lys Leu Asn He 
145 150 155 160 

Leu Lys Ala Leu Gly He Val 
165 



(2) INFORMATION FOR SEQ ID NO: 696: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696: 

Met Thr His Pro Gly Thr Glu His He He Ala Val Met He Thr Glu 
15 10 15 

Leu Arg Gly Lys Asp He Leu Ser Tyr Leu Glu Lys Asn He Ser Val 
20 25 30 

Gin Met Thr He Ala Val Gly Thr Arg Met Pro Pro Lys Asn Phe Ser 
35 40 45 

Arg Gly Ser Leu Val Phe Val Ser He Ser Phe He Val Leu Met He 
50 55 60 

He Ser Ser Ala Trp Leu He Phe Tyr Phe He Gin Lys He Arg Tyr 
65 70 75 80 

Thr Asn Ala Arg Asp Arg Asn Gin Arg Arg Leu Gly Asp Ala Ala Lys 
85 90 95 

Lys Ala He Ser Lys Leu Thr Thr Arg Thr Val Lys Lys Gly Asp Lys 
100 105 110 

Glu Thr Asp Pro Asp Phe Asp His Cys Ala Val Cys He Glu Ser Tyr 
115 120 125 

Lys Gin Asn Asp Val Val Arg He Leu Pro Cys Lys His Val Phe His 
130 135 140 

Lys Ser Cys Val Asp Pro Trp Leu Ser Glu His Cys Thr Cys Pro Met 
145 150 155 liSO 

Cys Lys Leu Asn He Leu Lys Ala Leu Gly He Val Pro Asn Leu Pro 
165 170 175 

Cys Thr Asp Asn Val Ala Phe Asp Met Glu Arg Leu Thr Arg Thr Gin 
180 185 190 



wo 98/54963 



699 



Ala Val Asn Arg Arg Ser Ala Leu Gly Asp Leu Ala Gly Asp. Asn Ser 
195 200 205 

Leu Gly Leu Glu Pro Leu Arg Thr Ser Gly He Ser Pro Leu Pro Gin 
210 215 220 

Asp Gly Glu Leu Thr Pro Arg Thr Gly Glu He Asn He Ala Val Thr 
225 230 235 240 

Lys Glu Trp Phe He He Ala Ser Phe Gly Leu Leu Ser Ala Leu Thr 
245 250 255 

Leu Cys Tyr Met He He Arg Ala Thr Ala Ser Leu Asn Ala Asn Glu 
260- 265 270 

Val Glu Trp Phe 
275 



(2) INFORMATION FOR SEQ ID NO: 697: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) lYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697: 

Thr Glu His He He Ala Val Met He Thr Glu Leu Arg Gly Lys Asp 

15 10 . 15 

He Leu Ser Tyr Leu Glu Lys Asn He Ser Val Gin Met Thr He Ala 
20 25 30 

Val Gly Thr Arg Met Pro Pro Lys Asn Phe Ser Arg Gly Ser Leu Val 
35 .40 45 

Phe Val Ser He Ser Phe He Val Leu Met He He Ser Ser Ala Trp 
50 55 60 

Leu He Phe Tyr Phe 
65 



(2) INFORMATION FOR SEQ ID NO: 698: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 antino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 698: 

Ser He Ser Phe He Val Leu Met He He Ser Ser Ala Trp Leu He 
15 10 15 

Phe Tyr Phe He Gin Lys He Arg Tyr Thr Asn Ala Arg Asp Arg Asn 
20 25 30 
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Gin Arg Arg Leu Gly Asp Ala Ala Lys Lys Ala lie Ser Lys Leu Thr 
35 40 45 

Thr Arg Thr Val Lys Lys Gly Asp Lys Glu 
50 55 



(2) INFORMATION FOR SEQ ID NO: 699: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 699: 

Val Lys Lys Gly Asp Lys Glu Thr Asp Pro Asp Phe Asp His Cys Ala 
1 5 10 .15 

Val Cys lie Glu Ser Tyr Lys Gin Asn Asp Val Val Arg lie Leu Pro 
20 25 30 

Cys Lys His Val Phe His Lys Ser Cys Val Asp Pro Trp Leu Ser Glu 
35 40 45 

His Cys Thr Cys Pro Met Cys Lys Leu Asn lie Leu Lys Ala Leu Gly 
50 55 60 

lie Val 
65 



<2) INFORMATION FOR SEQ ID NO: 700: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:^ 106 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 700: 

Met Thr His Pro Gly Thr Glu His lie He Ala Val Met He Thr Glu 
1 5 10 15 

Leu Arg Gly Lys Asp He Leu Ser Tyr Leu Glu Lys Asn He Ser Val 
20 25 30 

Gin Met Thr He Ala Val Gly Thr Arg Met Pro Pro Lys Asn Phe Ser 
35 40 45 

Arg Gly Ser Leu Val Phe Val Ser He Ser Phe He Val Leu Met He 
SO 55 60 

lie Ser Ser Ala Tip Leu He Phe Tyr Phe He Gin Lys He Arg Tyr 
65 70 75 80 

Thr Asn Ala Arg Asp Arg Asn Gin Arg Arg Leu Gly Asp Ala Ala Lys 
85 90 95 

Lys Ala He Ser Lys Leu Thr Thr Arg Thr 
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100 105 



(2) INFORMATION FOR SEQ ID NO: 701: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 701: 

Ala Ala Lys Lys Ala He Ser Lys Leu Thr Thr Arg Thr Val Lys Lys 
1 5 10 . 15 

Gly Asp Lys Glu Thr Asp Pro Asp Phe Asp His Cys Ala Val Cys He 
20 25 30 

Glu Ser Tyr Lys Gin Asn Asp Val Val Arg He Leu Pro Cys Lys His 
35 40 45 

Val Phe His Lys Ser Cys Val Asp Pro Trp Leu Ser Glu His Cys Thr 
50 55 60 

Cys Pro Met Cys Lys Leu Asn He Leu Lys Ala Leu Gly He Val Pro 
65 70 . 75 80 

Asn Leu Pro Cys 



(2) INFORMATION FOR SEQ ID NO: 702: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 702: 

Thr Gin Ala Val Asn Arg Arg Ser Ala Leu Gly Asp Leu Ala Gly Asp 
1. 5 10 15 

Asn Ser Leu Gly Leu Glu Pro Leu Arg Thr Ser Gly He Ser Pro Leu 
20 25 30 

Pro Gin Asp Gly Glu Leu Thr Pro Arg Thr Gly Glu He Asn He Ala 
,35 40 45 

Val Thr Lys Glu Trp Phe He He Ala Ser Phe Gly Leu Leu Ser Ala 
50 55 60 

Leu Thr Leu Cys Tyr Met He He Arg Ala Thr Ala Ser Leu Asn Ala 
65 70 75 80 



Asn Glu Val Glu Trp Phe 
85 
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(A) LSJGTH: 341 aniinc acids 
(3) lYrH: 2rir.3 acid 
r3) TCPCLCGY: linear 
:xi. SiQuSXS 3ESC?-?riC;;: SZQ ZZ ITZz 703: 



rro I«u His Giy Val Ala Asp His Leu 3iy Ci.-s Asp Pro Gin Thr Arg 
IS IZ 15 

?:-e ?he Val Pro Pro Asn lie lys Gin ?r? He Ala Leu Leu Gin Arg 
23 25 30 

Sly A-sr. Cys Thr Pie Lys Glu Lys lie Ser Arg AJla Ala ?he His Asn 
35 40 45 

Ala Val Ala Val Val He Tyr Asn Asn Lys Ser Lys Glu Glu Pro Val 
5j 55 50 

Tex Ke^ Thr His Pro Gly Thr Glu His He He Ala Val Met He Thr 
s5 79 75. 80 

31u Leu ?izg Gly Lys Asp He leu Ser Tyr Leu Glu Lys Asn He Ser 
55 9'Z 95 

Val Glr. Me^ Thr He A-a Val 31y Thr Arg Y.ez Pro Pro Lys Asn Phe 
IOC 105 110 

Ser Arg Gly Ser L«u Val Phe Val Ser He Ser Phe He Val Leu Met 
11= 120 125 

He He Ser Ser Ala Trp Leu He Phe Tyr Phe He Gin Lys He Arg 
13: 135 140 

Zj— Thr Asn Aia Ajrg Asp Ajrg Asn Gin Arg Arg Leu Giy Asp Ala Ala 
145 ISO 153 160 

lys Lys Ala He Ser Lys Leu Thr Thr Arg Thr Val Lys Lys Gly Asp 
155 171 175 

lys Glu Thr Asp Pro Asp Phe Asp His Cys Ala Val Cys He Glu Ser 
180 185 190 

Tyr Lys Gin Asn A^ Val Val Arg He Leu Pro Cys Lys His Val Phe 
13= ZOO 205 

His Lys Ser Cys Val Jisp Pro Trp Leu Ser Glu His Cys Thr Cys Pro 
210 215 220 

Meu Cys Lys Leu Asn He Leu Lys Ala Leu Gly He Val Pro Asn Leu 
225 230 225 240 

Pre Cys Thr Asp A-sn Val Ala Phe Asp Mer Glu Arg Leu Thr Arg Thr 
245 250 255 

Gin Ala Val Asn Ajrg Ajrg Ser Ala Leu Gly Asp Leu Ala Gly Asp Asn 
260 265 270 

Ser Leu Gly Glu Pro Leu Arg Thr Ser Gly He Ser Pro Leu Pro 
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275 280 285 

Gin Asp Gly Glu Leu Thr Pro Arg Thr Gly Glu lie Asn lie Ala Val 
290 295 300 

Thr Lys Glu Trp Phe lie lie Ala Ser Phe Gly Leu Leu Ser Ala Leu 
305 310 315 320 

Thr Leu Cys Tyr Met lie lie Arg Ala Thr Ala Ser Leu Asn Ala Asn 
325 330 335 

Glu. Val Glu Trp Phe 
340 



(2) INFORMATIC»3 FOR SEQ ID NO: 704: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICM^ : SEQ ID NO: 704: . 

His Gly Val Ala Asp His Leu Gly Cys Asp Pro Gin Thr Arg Phe Phe 
1 5 10 15 

Val Pro Pro Asn lie Lys Gin Trp lie Ala Leu Leu Gin Arg Gly Asn 

20 25 30 

Cys Thr Phe Lys Glu Lys lie Ser Arg Ala Ala Phe His Asn Ala Val 
35 40 45 

Ala Val Val lie Tyr Asn Asn Lys Ser Lys Glu Glu 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 705: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 705 :x 

Met Ser Gly Gin Gly I.eu Ala Gly Phe Phe Ala Ser Val Ala Met He 
15 10 15 

Cys Ala He Ala Ser Gly Ser Glu Leu Ser Glu Ser Ala Phe Gly Tyr 
20 25 30 

Phe He Thr Ala Cys Ala Val He He Leu Thr He He Cys Tyr Leu 
35 40 45 

Gly Leu Pro Arg Leu Glu Phe Tyr Arg Tyr Tyr Gin Gin Leu Lys Leu 
50 55 60 

Glu Gly Pro Gly Glu Gin Glu Thr Lys Leu Asp Leu He Ser Lys Gly 
65 ' 70 75 80 
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Glu Glu Pro Arg Ala Gly Lys Glu Glu Ser Gly Val Ser Val Ser Asn 
85 90 95 

Ser Gin Pro Thr Asn Glu Ser His Ser lie Lys Ala He Leu Lys Asn 
100 105 110 

He Ser Val Leu Ala Phe Ser Val Cys Phe He Phe Thr He Thr He 
115 120 125 

Gly Met Phe Pro Ala Val Thr Val Glu Val Lys Ser Ser He Ala Gly 
130 135 140 

Ser Ser Thr Trp Glu Arg Tyr Phe He Pro Val Ser Cys Phe Leu Thr 
145 150 155 160 

Phe Asn He Phe Asp Trp Leu Gly Arg Ser Leu Thr Ala Val Phe Met 
165 170 175 

Trp Pro Gly Lys Asp Ser Arg Trp Leu Pro Ser Trp Xaa Leu Ala Arg 
180 185 190 

Leu Val Phe Val Pro Leu Leu Leu Leu Cys Asn He Lys Pro Arg Arg 
195 200 205 

Tyr Leu Thr Val Val Phe Glu His Asp Ala Trp Phe He Phe Phe Met 
210 215 220 

Ala Ala Phe Ala Phe Ser Asn Gly Tyr Leu Ala Ser Leu Cys Mfet Cys 
225 230 235 240 

Phe Gly Pro Lys Lys Val Lys Pro Ala Glu Ala Glu Thr Ala Glu Pro 
245 250 255 

Ser Trp Pro Ser Ser Cys Val Trp Val Trp His Trp Gly Leu Phe Ser 
260 265 270 

Pro Ser Cys Ser Gly Gin Leu Cys Asp Lys Gly Trp Thr Glu Gly Leu 
.275 280 285 

Pro Ala Ser Leu Pro Val Cys Leu Leu Pro Leu Pro Ser Ala Arg Gly 
290 295 300 



Asp Pro Glu Trp Ser Gly Gly Phe Phe Phe 
305 310 



(2) INFOPMATXQN FOR SEQ ID NO: 706: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LE2SGTH: 106 amino acids 

(B) T!rPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706: 

Met Ser Gly Gin Gly Leu Ala Gly Phe Phe Ala Ser Val Ala Met He 
1 5 . 10 15 . 

Cys Ala He Ala Ser Gly Ser Glu Leu Ser Glu Ser Ala Phe Gly Tyr 
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20 



25 



30 



Phe He Thr Ala Cys Ala Val He He Leu Thr He He Cys Tyr Leu 
35 40 45 

Gly Leu Pro Arg Leu Glu Phe Tyr Arg Tyr Tyr Gin Gin Leu Lys Leu 
50 55 60 

Glu Gly Pro Gly Glu Gin Glu Thr Lys Leu Asp Leu He Ser Lys Gly 
65 70 75 80 

Glu Glu Pro Arg Ala Gly Lys Glu Glu Ser Gly Val Ser Val Ser Asn 
85 90 95 



Ser Gin Pro Thr Asn Glu Ser His Ser He 
100 105 



(2) INFORMATION FOR SEQ ID NO: 707: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acidbs 

(B) TYPE: amino acid 
(D) T0P0LCX5Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707: 

Ser Gly Val Ser Val Ser Asn Ser Gin Pro Thr Asn Glu Ser His Ser 
15 10 15 

He Lys Ala He Leu Lys Asn He Ser Val Leu Ala Phe Ser Val Cys 
20 25 30 

Phe He Phe Thr He Thr He Gly Met Phe Pro Ala Val Thr Val Glu 
35 40 45 

Val Lys Ser Ser He Ala Gly Ser Ser Thr Trp Glu Arg Tyr Phe He 
50 55 60 

Pro Val Ser Cyis Phe Leu Thr Phe Asn He Phe Asp Trp Leu Gly Arg 
65 70 75 80 

Ser 



(2) INFORMATION FOR SEQ ID NO: 708: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acicJs 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708: 

Thr He Gly Met Phe Pro Ala Val Thr Val Glu Val Lys Ser Ser He 
15 10 15 

Ala Gly Ser Ser Thr Trp Glu Arg Tyr Phe He Pro Val Ser Cys Phe 
20 25 30 
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Leu Thr Phe Asn lie Phe Asp Trp Leu Gly Arg Ser Leu Thr Ala Val 
35 40 45 

Phe Met Trp Pro Gly Lys Asp Ser Arg Trp Leu Pro Ser Trp Xaa Leu 
50 55 60 

Ala Arg Leu Val Phe Val Pro Leu Leu Leu Leu Cys Asn He Lys Pro 
65 70 75 80 

Arg Arg Tyr Leu Thr Val Val Phe Glu His Asp Ala 
85 90 



(2) INFOHMATION FOR SEQ ID NO: 709: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 709: 

Phe Gly Pro Lys Lys Val Lys Pro Ala Glu Ala Glu Thr Ala Glu Pro 
1 5 10 15 

Ser Trp Pro Ser Ser Cys val Trp Val Trp His Trp Gly Leu Phe Ser 
20 25 30 

Pro Ser Cys Ser Gly Gin Leu Cys Asp Lys Gly Trp Thr Glu Gly Leu 
35 40 45 

Pro Ala Ser Leu Pro Val Cys Leu Leu Pro Leu Pro Ser Ala Arg Gly 
50 55, 60 

Asp Pro Glu Trp Ser Gly Gly Phe Phe Phe 
65 70 • 



(2) INFORMATION FOR SEQ ID NO: 710: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 710: 

Asp Asp Asp Gly Phe Glu He Val Pro He Glu Asp Pro Ala Lys His 
1 5 10 15 

Arg He Leu Asp Pro Glu Gly Leu Ala Leu Gly Ala Val He Ala Ser 
20 25 30 

Ser Lys Lys Ala Lys Arg Asp Leu He Asp Asn Ser Phe Asn Arg Tyr 
35 40 45 

Thr Phe Asn Glu Asp Glu Gly Glu Leu Pro Glu Trp Phe Val Gin Glu 
50 55 60 
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Glu Lys Gin His Arg lie Arg Gin Leu Pro Val Gly Lys Lys Glu Val 
65 70 75 80 

Glu His Tyr Arg Lys Arg Tip Arg. Glu lie Asn Ala Arg Pro He Xaa 

85 90 95 

• *• 

Xaa Xaia Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xeia Xaa Xaa Xaa Xaa 
100 105 110 

Leu Glu Gin Thr Arg Lys Lys Ala Glu Ala Val Val Asn Thr Val Asp 
115 120 125 

He Xcia Arg Thr Arg Glu Ser 
130 135 



(2) INFORMATION FOR SEQ ID NO: 711: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TVFE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 711: 

Asp Asp Asp Gly Phe Glu He Val Pro He Glu Asp Pro Ala Lys His 
15 10 15 

Arg He Leu Asp Pro Glu Gly Leu Ala Leu Gly Ala Val He Ala Ser 
20 25 30 

Ser Lys Lys Ala Lys Arg Asp Leu He Asp Asn Ser Phe Asn Arg Tyr 
35 40 45 

Thr Phe 
50 



(2) INFORjMATION FOR SEQ ID NO: '712: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TV1PE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOI : SEQ ID NO: 712: 

Lys Arg Trp Arg Glu He Asn Ala Arg Pro He Xaa Xaa Xaa Xaa Xaa 
1 5 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Glu Gin Thr 
20 25 30 

Arg Lys Lys Ala Glu Ala Val Val Asn Thr Val Asp He Xaa Arg Thr 
35 40 45 

Arg Glu Ser 
50 
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(2) IMFORMATIGN FOR SEQ ID NO: 713: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 amino acicSs 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 713: 

Met lie Lys Asp Lys Gly Arg Ala Arg Thr Ala Leu Thr Ser Ser Gin 
1 5 10 15 

Pro Ala His Leu Cys Pro Glu Asn Pro Leu Leu His Leu Lys Ala Ala 
20 25 30 

Val Lys Glu Lys Lys Arg Asn Lys Lys Lys Lys Thr lie Gly Ser Pro 
35 . 40 45 

Lys Arg lie Gin Ser Pro Leu Asn Asn Lys Leu Leu Asn Ser Pro Ala 
50 55 60 

Lys Thr Leu Pro Gly Ala Cys Gly Ser Pro Gin Lys Leu He Asp Gly 
65 70 75 80 

Phe Leu Lys His Glu Gly Pro Pro Ala Glu Lys Pro Leu Glu Glu Leu 
85 90 95 

Ser Ala Ser Thr Ser Gly Val Pro Gly Leu Ser Ser Leu Gin Ser Asp 
100 105 110 

Pro Ala Gly Cys Val Arg Pro Pro Ala Pro Asn Leu Ala Gly Ala Val 
115 120 125 

Glu Phe Asn Asp Val Lys Thr Leu Leu Arg Glu Trp He Thr Thr He 
130 135 140 

Ser Asp Pro Met Glu Glu Asp He Leu Gin Val Val Lys Tyr Cys Thr 
145 . 150 * 155 160 

Asp Leu He Glu Glu Lys Asp Leu Glu Lys Leu Asp Leu Val He Lys 
165 170 175 

Tyr Met Lys Arg Leu Met Gin Gin Ser Val Glu Ser Val Trp Asn Met 
180 185 190 

Ala Phe Asp Phe He Leu Asp Asn Val Gin Val Val Leu Gin Gin Thr 
195 . 200 . 205 

Tyr Gly Ser Thr Leu Lys Val Thr 
210 215 



(2) INFORMATION FOR SEQ ID NO: 714: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 52 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714: 
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Met lie Lys Asp Lys Gly Arg Ala Arg Thr Ala Leu Thr Ser Ser Gin 
1 5 10 15 • 

Pro Ala His Leu Cys Pro Glu Asn Pro Leu Leu His Leu Lys Ala Ala 
20 25 30 

Val Lys Glu Lys Lys Arg Asn Lys Lys Lys Lys Thr lie Gly Ser Pro 
35 40 45 

Lys Arg lie Gin 
50 



(2) INFORMATION FOR SEQ ID NO: 715: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715: 

Lys Arg lie Gin Ser Pro Leu Asn Asn Lys Leu Leu Asn Ser Pro Ala 
1 5 10 15 ' 

Lys Thr Leu Pro Gly Ala Cys Gly Ser Pro Gin Lys Leu lie Asp Gly 
20 25 30 

Phe Leu Lys His Glu Gly Pro Pro Ala Glu Lys Pro Leu Glu Glu Leu 
35 40 45 

Ser Ala Ser Thr Ser Gly Val Pro Gly Leu Ser Ser Leu Gin Ser Asp " 
50 55 60 

Pro Ala Gly Cys Val Arg Pro Pro Ala Pro Asn Leu Ala Gly Ala Val 
65 70 75 80 

Glu Phe Asn Asp Val Lys Thr Leu Leu Arg Glu Trp lie Thr Thr lie 
85 90 95 

Ser Asp Pro Met 
100 



(2) INFORMATION FOR SEQ 10 NO: 716: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEC^JENCE DESCRIPTION: SEQ ID NO: 716: 

Thr He Ser Asp Pro Met Glu Glu Asp He Leu Gin Val Val Lys Tyr 
1 5 10 15 



Cys Thr Asp Leu He Glu Glu Lys Asp Leu Glu Lys Leu Asp Leu Val 
20 25 30 
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He Lys Tyr Met Lys Arg Leu Met Gin Gin Ser Val Glu Ser Val Trp 
35 40 45 

Asn Met Ala Phe Asp Phe He Leu Asp Asn Val Gin Val Val Leu Gin 
50 55 ' 60 

Gin Thr Tyr Gly Ser Thr Leu Lys Val Thr 
65 70 



(2) INFORMATION FOR SEQ ID NO: 717: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717: 

Phe Cys His Asp Cys Lys Phe E>ro Glu Ala Ser Pro Ala Met Asn Cys 
15 10 15 

Glu Pro 



(2) INFORMATION FOR SEQ ID NO: 718: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718: 

Phe Cys His Asp Cys Lys Phe Pro Glu Ala Ser Pro Ala Met Asn Cys 
15 10 15 

Glu Pro 



(2) INFORMATION FOR SEQ ID NO: 719: 

(i) SEQUENCE CHARACTERISTICS: 
V (A) LmnH: 27 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 719: 

Pro Gin Pro Ser Asn Phe Pro Thr Thr Val Arg Asn Leu Pro Tyr Ser 
15 10 15 

Gly Ala Gly Ala Gin Pro Pro Pro Ser Asn Cys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 720: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 720: 

Met Ala Ser Ser Val Pro Ala Gly Gly His Thr Arg Ala Gly Gly He 
15 10 15 

Phe Leu He Gly Lys Leu Asp Leu Glu Ala Ser Leu Phe Lys Ser Phe 
20 25 30 

Gin Trp Leu Pro Phe Val Leu Arg Lys Lys Cys Asri Phe Phe Cys Trp 
35 40 45 

Asp Ser Ser Ala His Ser Leu Pro Leu His Pro Leu Ser Ala Ser Cys 
50 55 60 

Ser Ala Pro Ala Cys His Ala Ser Asp Thr His Leu Leu Tyr Pro Ser 
65 70 75 80 

Thr Arg Ala Leu Cys Pro Ser He Phe Ala Trp Leu Val Ala Pro His 
85 90 95 

Ser Val Phe Arg Thr Asn Ala Pro Gly Pro Thr Pro Ser Ser Gin Ser 
100 . 105 110 

Ser Pro Val Phe Pro Val Phe Pro Val Ser Phe Met Ala Leu He Val 
115. 120 125 

Cys Xaa Leu V2l1 Cys Cys 
130 



(2) INFORMATION FOR SEQ ID NO: 721: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEI^STH: 71 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 721: 

Met Ala Ser Ser Val Pro Ala Gly Gly His Thr Arg Ala Gly Gly He 
1 5 10 15 

Phe Leu He Gly Lys Leu Asp Leu Glu Ala Ser Leu Phe Lys Ser Phe 
20 25 30 

Gin Trp Leu Pro Phe Val Leu Arg Lys Lys Cys Asn Phe Phe Cys Trp 
35 40 45 

Asp Ser Ser Ala His Ser Leu Pro Leu His Pro Leu Ser Ala Ser Cys 
50 55 60 



Ser Ala Pro Ala Cys His Ala 
65 70 
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(2) INFORMATION FOR SEQ ID NO: 722: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722: 

Phe Ala Trp Leu Val Ala Pro His Ser Val Phe Arg Thr Asn Ala Pro 
1 5 10 15 

Gly Pro Thr Pro Ser Ser Gin Ser Ser Pro Val Phe Pro Val Phe Pro 
20 25 30 

Val Ser Phe Met Ala Leu He Val Cys Xaa Leu Val Cys Cys 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 723: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOTJ: SEQ ID NO: 723: 

Met Ala Ser Ser Val Pro Ala Gly Gly His Thr Arg Ala Gly Gly He 
1 5 10 .15 

Phe Leu He Gly Lys Leu Asp Leu Glu Ala Ser Leu Phe Lys Ser Phe 
20 25 30 

Gin Trp Leu Pro Phe Val Leu Arg Lys Lys Cys Asn Phe Phe Cys Trp 
35 40 45 

Asp Ser Ser Ala His Ser Leu Pro Leu His Pro Leu Ser Ala Ser Cys 
50 55 60 

Ser Ala Pro Ala Cys His Ala Ser Asp Thr His Leu Leu Tyr Pro Ser 
65 70 75 . 80 

Thr Arg Ala Leu Cys Pro Ser He Phe Ala Trp Leu Val Ala Pro His 
85 90 95 

Ser Val Phe Arg Thr Asn Ala Pro Gly Pro Thr Pro Ser Ser Gin Ser 
100 105 110 

Ser Pro Val Phe Pro Val Phe Pro Val Ser Phe Met Ala Leu He Val 
115 120 125 

Cys Xaa Leu Val Cys Cys 
130 



(2) INFORMATION FOR SEQ ID NO: 724: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 724: 

5 

Met Ala Met Glu Gly Tyr Trp Arg Phe Leu Ala Leu Leu Gly Ser Ala 
15 10 15 

Leu Leu Val Gly Phe Leu Ser Val He Phe Ala Leu Val Trp Val Leu 
10 .20 25 30 

His Tyr Arg Glu Gly Leu Gly Trp Asp Gly Ser Ala Leu Glu Phe Asn 
35 40 45 

15 Trp His Pro Val Leu Met Val Thr Gly Phe Val Phe He Gin Gly He 
50 55. 60 

Ala He He Val Tyr Arg Leu Pro Trp Thr Trp Lys Cys Ser Lys Leu 
65 70 75 80 

20 

Leu Met Lys Ser He His Ala Gly Leu Asn Ala Val Ala Ala He Leu 
85 90 95 

Ala He lie Ser Val Val Ala Val Phe Glu Asn His Asn Val Asn Asn 
25 100 105 110 

He Ala Asn Met Tyr Ser Leu His Ser Trp Val Gly Leu He Ala Val 
115 120 . 125 

30 He Cys Tyr Leu Leu Gin Leu Leu Ser Gly Phe Ser Val Phe Leu Leu 
130 135 140 

Pro Tip Ala Pro Leu Ser Leu Arg Ala Phe Leu Met Pro He His Val 
145 150 155 160 

35 

Tyr Ser Gly He Val He Phe Gly Thr Val He Ala Thr Ala Leu Met 
165 ■ 170 175 

Gly Leu Thr Glu Lys Leu He Phe Ser Leu Arg Asp Pro Ala Tyr Ser 
40 180 185 190 

Thr Phe Pro Pro Glu Gly Val Phe Val Asn Thr Leu Gly Leu Leu He 

195 200 205 

45 Leu Val Phe Gly Ala Leu He Phe Trp He Val Thr Arg Pro Gin Trp 
210 215 220 

Lys Arg Pro Lys Glu Pro Asn Ser Thr He Leu His Pro Asn Gly Gly 
225 230 235 240 

50 

Thr Glu Gin Gly Ala Arg Gly Ser Met Pro Ala Tyr Ser Gly Asn Asn 
245 250 255 

Met Asp Lys Ser Asp Ser Glu Leu Asn Ser Glu Val Ala Ala Arg Lys 
55 260 265 270 

Arg Asn Leu Ala Leu Asp Glu Ala Gly Gin Arg Ser Thr Met 
275 280 285 

60 
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(2) INFORMATION FOR SEQ ID NO: 725: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acic3s 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 725: 

Pro Gly Arg Ala Gly Pro Ser Pro Gly Leu Ser Leu Gin Leu Pro Ala 
15 10 15 

Glu Pro Gly His Pro Ala Gly Asn Leu Ala Pro Leu Thr Ser Arg Pro 
20 25 30 

Gin Pro Leu Cys Arg lie Pro Ala Val Pro Gly 
35 40 



(2) INFORMATION FOR SEQ ID NO: 726: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 424 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 726: 

Met Lys Leu Leu Gly Glu Cys Ser Ser ser lie Asp Ser Val Lys Arg 
1 5 10 15 

Leu Glu His Lys Leu Lys Glu Glu Glu Glu Ser Leu Pro Gly Phe Val 
20 25 30 

Asn Leu His Ser Thr Glu Thr Gin Thr Ala Gly Val lie Asp Arg Trp 
35 40 45 

Glu Leu Leu Gin Ala Gin Ala Leu Ser Lys Glu Leu Arg Met Lys Gin 
50 55 60 

Asn Leu Gin Lys Trp Gin Gin Phe Asn Ser Asp Leu Asn Ser lie Trp 
65 70 75 80 

Ala Trp Leu Gly Asp Thr Glu Glu Glu Leu Glu Gin Leu Gin Arg Leu 
85 90 95 

Glu Leu Ser Thr Asp He Gin Thr He Glu Leu Gin He Lys Lys Leu 
100 105 110 

Lys Glu Leu Gin Lys Ala Val Asp His Arg Lys Ala He He Leu Ser 
115 120 125 

He Asn Leu Cys Ser Pro Glu Phe Thr Gin Ala Asp Ser Lys Glu Ser 
130 135 140 

Arg Asp Leu Gin Asp Arg Leu Xaa Gin Met Asn Gly Arg Trp Asp Arg 
145 150 155 160 

Val Cys Ser Leu Leu Glu Glu Trp Arg Gly Leu Leu Gin Asp Ala Leii 
165 170 175 
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Met Gin Cys Gin Gly Phe His Glu Met Ser His Gly Leu Leu Leu Met 
180 185 190 

Leu Glu Asn lie Asp Arg Arg Lys Asn.Glu lie Val Pro lie Asp Ser 
195 200 205 

Asn Leu Asp Ala Glu lie Leu Gin Asp His His Lys Gin Leu Met Gin 
210 215 220 

lie Lys His Glu Leu Leu Glu Ser Gin Leu Arg Val Ala Ser Leu Gin 
225 230 235 240 

Asp Met Ser Cys Gin Leu Leu Val Asn Ala Glu Gly Thr Asp Cys Leu 
245 250 255 

Glu Ala Lys Glu Lys Val His Val lie Gly Asn Arg Leu Lys Leu Leu 
260 265 270 

Leu Lys Glu Val Ser Arg His lie Lys Glu Leu Glu Lys Leu Leu Asp 
275 280 285 

Val Ser Ser Ser Gin Gin Asp Leu Ser Ser Trp Ser Ser Ala Asp Glu 
290 295 . 300 

Leu Asp Thr Ser Gly Ser Val Ser Pro Xaa Ser Gly Arg Ser Thr Pro 
305 310 315 320 

Asn Arg Gin Lys Thr Pro Arg Gly Lys Cys Ser Leu Ser Gin Pro Gly 
325 330 335 

Pro Ser Val Ser Ser Pro His Ser Arg Ser Thr Lys Gly Gly Ser Asp 
340 345 350 

Ser Ser Leu Ser Glu Pro Xaa. Pro Gly Arg Ser Gly Arg Gly Phe Leu 
355 360 365 

Phe Arg Val Leu Arg Ala Ala Leu Pro Leu Gin Leu Leu Leu Leu Leu 
370 375 380 

Leu He Gly Leu Ala Cys Leu Val Pro Met Ser Glu Glu Asp Tyr Ser 
385 390 395 400 

Cys Ala Leu Ser Asn Asn Phe Ala Arg Ser Phe His Pro Met Leu Arg 
405 410 415 



Tyr Thr Asn Gly Pro Pro Pro Leu 
420 



(2) INFORMATION FOR SEQ ID NO: 727: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727: 

Met Lys Leu Leu Gly Glu Cys Ser Ser Ser He Asp Ser Val Lys Arg 
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15 10 15 

Leu Glu His Lys Leu Lys Glu Glu Glu Glu Ser Leu Pro Gly Phe Val 
20 25 30 

Asn Leu His Ser Thr Glu Thr Gin Thr Ala Gly Val lie Asp Arg Trp 
35 . 40 45 

Glu Leu Leu Gin Ala Gin Ala Leu Ser Lys Glu Leu Arg Met Lys Gin 
50 55 60 



Asn Leu Gin Lys Trp Gin Gin Phe Asn Ser Asp Leu Asn Ser lie Trp 
65 70 75 80 

Ala Trp Leu Gly Asp Thr Glu Glu Glu Leu Glu Gin Leu Gin Arg Leu 
85 90 95 

Glu Leu Ser Thr Asp lie Gin Thr lie Glu Leu Gin lie Lys 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 728: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 728: 

Lys Leu Lys Glu Leu Gin Lys Ala Val Asp His Arg Lys Ala lie lie 
1 .5 10 15 

Leu Ser lie Asn Leu Cys Ser Pro Glu Phe Thr Gin Ala Asp Ser Lys 
20 25 30 

Glu Ser Arg Asp Leu Gin Asp Arg Leu Xaa Gin Met Asn Gly Arg Trp 
35 " 40 45 

Asp Arg Vad Cys Ser Leu Leu Glu Glu Trp Arg Gly Leu Leu Gin Asp 
50 55 60 

Ala Leu Met Gin Cys Gin Gly Phe His Glu Met Ser His Gly Leu Leu 
65 70 75 80 

Leu Met Leu Glu Asn lie Asp Arg Arg Lys Asn Glu He Val Pro He 
85 90 95 

Asp Ser Asn Leu Asp Ala Glu He Leu Gin Asp His His Lys Gin Leu 
100 105 110 

Met Gin He Lys His Glu Leu Leu Glu Ser Gin Leu Arg Val Ala Ser 
115 120 125 

Leu Gin Asp M^t Ser Cys Gin Leu 
130 135 



(2) INFORMATIC»J FOR SEQ ID NO: 729: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 729: 

Gin Asp Met Ser Cys Gin Leu Leu Val Asn Ala Glu Gly Thr Asp Cys 
15 10 15 

Leu Glu Ala Lys Glu Lys Val His Val lie Gly Asn Arg Leu Lys Leu 
20 25 30 

Leu Leu Lys Glu Val Ser Arg His lie Lys Glu Leu Glu Lys Leu Leu 
35 40 45 

Asp Val Ser Ser Ser Gin Gin Asp Leu Ser Ser Trp Ser Ser Ala Asp 
50 55 60 

Glu Leu Asp Thr Ser Gly Ser Val Ser Pro Xaa Ser Gly Arg Ser Thr 
65 70 75 80 

Pro Asn Arg Gin Lys Thr Pro Arg Gly Lys Cys Ser Leu Ser Gin Pro 
85 90 95 

Gly Pro Ser Val Ser Ser Pro His Ser v 
100 105 ' 



(2) INFORMATION FOR SEQ ID NO: 730: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

tB) TYPE: amino acid 

CD) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 730: 

Asp Ser Ser Leu Ser Glu Pro Xaa Pro Gly Arg Ser Gly Arg Gly Phe 
1 5 10 15 

Leu Phe Arg Val Leu Arg Ala Ala Leu Pro Leu Gin Leu Leu Leu Leu 
20 25 30 

Leu Leu lie Gly Leu Ala Cys Leu Val Pro Met Ser Glu Glu Asp Tyr 
35 40 45 

Ser Cys Ala Leu Ser Asn Asn Phe Ala Arg Ser Phe His Pro Met Leu 
50 55 60 

Arg Tyr Thr Asn Gly Pro Pro Pro Leu 
65 70 



(2) IMF0RMATIC»9 FOR SEQ ID NO: 731: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLCXSY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731: 

Met Lys Leu Leu He Cys Gly Asxi Tyr Leu Ala Pro Ser His Ser Glu 
1 5 10 15 

Ser Ser Arg Arg Cys Cys Leu Leu Cys Phe Tyr Pro Leu Cys Leu Glu 
20 25 30 

He Asn Phe Gly Met Lys Val Phe Leu Ser Met Pro Phe Leu Val Leu 
35 . 40 45 

Phe Gin Ser Leu He Gin Glu Asp 
50 55 



(2) INFORMATION FOR SEQ ID NO: 732: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 732: 

Arg He Leu Leu Val Lys Tyr Ser Ala Asn Glu Glu Asn Lys Tyr Asp 
15 10 15 

Tyr Leu Pro Thr Thr Val Asn Val Cys Ser Glu Leu Val Lys Leu Val 
20 25 30 

Phe Cys Val Leu Val Ser Phe Cys Val He Lys Lys Asp His Gin Ser 
35 40 45 

Arg Asn Leu Lys Tyr Ala Ser Trp Lys Glu Phe Ser Asp Phe Met Lys 
50 55 60 

Trp Ser He Pro Ala Phe Leu Tyr Phe Leu Asp Asn Leu He Val Phe 
65 70 75 80 

Tyr Val Leu Ser Tyr Leu Gin Pro Ala Met Ala Val He Phe Ser Asn 
85 90 95 

Phe Ser He He Thr Thr Ala Leu Leu Phe Arg He Val Leu Lys Xaa 
100 105 110 

Arg Leu Asn Trp He Gin Trp Ala Ser Leu Leu Thr Leu Phe Leu Ser 
115 120 125 

He Val Ala Leu Thr Ala Gly Thr Lys Thr Leu Gin His Asn Leu Ala 
130 135 140 

Gly Arg Gly Phe His His Asp Ala Phe Phe Ser Pro Ser Asn Ser Cys 
145 150 155 160 

Leu Leu Phe Arg Asn Glu Cys Pro Arg Lys Asp Asn Cys Thr Ala Lys 
165 170 175 

Glu Trp Thr Phe Pro Glu Ala Lys Trp Asn Thr Thr Ala Arg Val Phe 
180 185 190 
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Ser His He Arg Leu Gly Met Gly His Val Leu He He Val Gin Cys 
195 200 205 

Phe He Ser Ser Met Ala Asn He Tyr . Asn Glu Lys He Leu Lys Glu 
210 215 220 

Gly Asn Gin Leu Thr Glu Xaa He Phe He Gin Asn Ser Lys Leu Tyr 
225 230 235 ' 240 

Phe Phe Gly He Leu Phe Asn Gly Leu Thr Leu Gly Leu Gin Arg Ser 
245 250 255 

Asn Arg Asp Gin He Lys Asn Cys Gly Phe Phe Tyr Gly His Ser 
260 265 270 



(2) INFORMATICMJ FOR SEQ ID NO: 733: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 733: 

Asn Ser Val Pro Asn Leu Gin Thr Leu Ala Val Leu Thr Glu Ala He 
15 10 15 

Gly Pro Glu Pro Ala He Pro Arg Xaa Pro Arg Glu Pro Pro Val Ala 
20 25 30 

Thr Ser Thr Pro Ala Thr Pro Ser Ala Gly Pro Gin Pro Leu Pro Thr 
35 40 45 

Gly Thr Val Leu Val Pro Gly Gly Pro Ala Pro Pro Cys Leu Gly Glu 
50 §5 60 

Ala Trp Ala Leu Leu Leu Pro Pro Cys Arg Pro Ser Leu Thr Ser Cys 
^5 70 75 80 

Phe Trp Ser Pro Arg Pro Ser Pro Trp Lys Glu Thr Gly Val 
85 90 



(2) INFORMATION FOR SEQ ID NO: 734: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734: 

Ala Leu Gin Leu Ala Phe Tyr Pro Asp Ala Val Glu Glu Trp Leu Glu 
15 10 15 

Glu Asn Val His Pro Ser Leu Gin Arg Leii Gin Xaa Leu Leu Gin Asp 
20 25 30 



wo 98/54963 



720 



Leu Ser Glu Val Ser Ala Pro Pro 
35 40 



(2) INFORMATION FOR SEQ ID NO: 735: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(xi) SEQUE^K:E DESCRIPTION: SEQ ID NO: 735: 

Cys His Pro Pro Ala Leu Ala Gly Thr Leu Leu Arg Thr Pro Glu Gly 
15 10 15 

Arg Ala His Ala Arg Gly Leu Leu Leu Glu Ala Gly Gly Ala 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 736: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 736: 

Gly Ser Ser Ser Thr Arg Ser Trp Phe Ser Thr Ser Ser Pro Gin Arg 
1 5 10 15 

Ser Ma Ser Trp His Ser Gly Ala Pro Ser Cys Arg Ser Trp Arg Leu 
20 25 30 

Pro Cys Ser Trp Leu Ser Thr Arg Met Pro Trp Arg Ser Gly Trp Arg 
35 40 45 

Lys Thr Cys Thr Pro Ala Cys Ser Gly Cys Lys 
50 55 



(2) INFORMATION FOR SEQ ID NO: 737: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 737: 



Met Arg Pro Asp Trp Lys Ala Gly Ala Gly Pro Gly Gly Pro Pro Gin 
15 10 15 

Lys Pro Ala Pro Ser Ser Gin Arg Lys Pro Pro Ala Arg Pro Ser Ala 
20 25 30 

Ala Ala Ala Ala He Ala Val Ala Ala Ala Glu Glu Glu Arg Arg Leu 
35 40 45 
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Arg Gin Arg Asn Arg Leu Arg Leu Glu Glu Asp Lys Pro Ala Val Glu 
50 . 55 60 

Arg Cys Leu Glu Glu Leu Val Phe Gly Asp Val Glu Asn Asp Glu Asp 
^5 70 75 80 

Ala Leu Leu Arg Arg Leu Arg Gly Pro Arg Val Gin Glu His Glu Asp 
85 . 90 95 

Ser Gly Asp Ser Glu Val Glu Asn Glu Ala Lys Gly Asn Phe Pro Pro 
100 105 110 

Gin Lys Lys Pro Val Trp Val Asp Glu Glu Asp Glu Asp Glu Glu Met 
115 120 125 

Val Asp Met Met Asn Asn Arg Phe Arg Lys Asp Met Met Lys Asn Ala 
130 135 140 

Ser Glu Ser Lys Leu Ser Lys Asp Asn Leu Lys Lys Arg Leu Lys Glu 
150 155 160 

Glu Phe Gin His Ala Met Gly Gly Val Pro Ala Trp Ala Glu Thr Thr 
165 170 175 

Lys Arg Lys Thr Ser Ser Asp Asp Glu Ser Glu Glu Asp Glu Asp Asp 
180 185 190 

Leu Leu Gin Arg Thr Gly Asn Phe lie Ser Thr Ser Thr Ser Leu Pro 
195 200 205 

Arg Gly He Leu Lys Met Lys Asn Cys Gin His Ala Asn Ala Glu Arg 
210 215 220 

Pro Thr Val Ala Arg He Ser He Cys Ala Val Pro Ser Arg Cys Thr 
225 230 235 240 



Asp Cys Asp Gly Cys Trp Asp 
245 



(2) INFORMATION FOR SEQ ID NO: 738: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 738: 

Cys Leu Glu Glu Leu Val Phe Gly Asp Val Glu Asn Asp Glu Asp Ala 
1 5 . 10 15 

Leu Leu Arg Arg Leu Arg Gly Pro Arg Val Gin Glu His Glu Asp Ser 
20 25 30 

Gly Asp Ser Glu Val Glu Asn Glu Ala Lys Gly Asn Phe Pro Pro Gin 
35 40 45 

Lys Lys Igro Val Trp Val Asp Glu Glu Asp Glu Asp Glu Glu Met Val 
50 55 60 
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Asp Met Met Asn Asn Arg Phe Arg Lys Asp Met Met Lys Asn Ala Ser 
^5 70 75 80 

Glu Ser Lys Leu Ser Lys Asp Asn Leu Lys Lys Arg Leu Lys Glu Glu 
85 90 95 

Phe Gin His Ala Met Gly Gly Val Pro Ala Trp Ala Glu Thr Thr Lys 
100 105 110 

Arg Lys Thr Ser Ser Asp Asp Glu Ser Glu Glu Asp Glu Asp Asp Leu 
115 120 125 

Leu Gin Arg- Thr Gly Asn Phe He Ser Thr Ser Thr Ser Leu Pro Arg 
13b 135 140 

Gly He Leu Lys Met Lys Asn Cys Gin His Ala Asn Ala Glu Arg Pro 
1^5 150 155 160 

Thr Val Ala Arg He Ser He Cys Ala Val Pro Ser Arg Cys Thr Asp 
165 170 175 

Cys Asp Gly Cys 
180 



\ 

(2) INPORMATION FOR SEQ ID NO: 739: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 739: 

Leu Lys Glu Lys He Val Arg Ser Phe Glu Val Ser Pro Asp Gly Ser 
15 10 15 

Phe Leu Leu He Asn Gly He Ala Gly Tyr Leu His Leu Leu Ala Met 
20 25 30 

Lys Thr Lys Glu Leu He Gly Ser Met Lys He Asn Gly Arg Val Ala 
35 40 45 

Ala Ser Thr Phe Ser Ser Asp Ser Lys Lys Val Tyr Ala Ser Ser Gly 
50 55 60 

Asp Gly Glu Val Tyr Val Trp Asp Val Asn Ser Arg Lys Cys Leu Asn 
65 70 75 80 

Arg Phe Val Asp Glu Gly Ser Leu Tyr Gly Leu Ser He Ala Thr Ser 
85 90 95 

Arg Asn Gly Gin Tyr Val Ala Cys Gly Ser Asn Cys Gly Val Val Asn 
100 105 110 

He Tyr Asn Gin Asp Ser Cys Leu Gin Glu Thr Asn Pro Lys Pro He 
115 120 125 

Lys Ala He Met Asn Leu Val Thr Gly Val Thr Ser Leu Thr Phe Asn 
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130 135 140 

Pro Thr Thr Glu lie Leu Ala He Ala Ser Glu Lys Met Lys Glu Ala 
ISO • 155 '160 

Val Arg Leu Val His Leu Pro Ser Cys Thr Val Phe Ser Asn Phe Pro 
165 170 175 

Val He Lys Asn Lys ASn He Ser His Val His Thr Met Asp Phe Ser 
180 185 190 

Pro Arg Ser Gly Tyr Phe Ala Leu Gly Asn Glu Lys Gly Lys Ala Leu 
195 200 205 

Met Tyr Arg Leu His His Tyr Ser Asp Phe 
210 215 



(2) INFORMATION FOR SEQ ID NO: 740: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY:' linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740: 

Lys He Asn Gly Arg Val Ala Ala Ser Thr Phe Ser Ser Asp Ser Lys 
1 5 .10 15 

Lys Val Tyr Ala Ser Ser Gly Asp Gly Glu Val Tyr Val Trp Asp Val 
20 25 30 

Asn Ser Arg Lys Cys Leu Asn Arg Phe Val Asp Glu Gly Ser Leu Tyr 
35 40 45 

Gly Leu Ser He Ala Thr S§r Arg Asn Gly Gin Tyr Val Ala Cys Gly 
50 55 60 

Ser Asn Cys Gly Val Val Asn He Tyr Asn Gin Asp Ser Cys Leu Gin 
^5 70 75 80 

Glu Thr Asn Pro Lys Pro He Lys Ala He Met Asn Leu Val Thr Gly 
85 90' 95 

Val Thr Ser Leu Thr Phe Asn Pro Thr Thr Glu He Leu Ala He Ala 
100 105 110 

Ser Glu Lys Met Lys Glu Ala Val Arg Leu Val His Leu Pro Ser Cys 
115 120 125 

Thr val Phe Ser Asn Phe Pro Val He Lys Asn Lys Asn He Ser His 
130 135 140 

Val His Thr Met Asp Phe Ser Pro Arg Ser Gly Tyr Phe Ala Leu Gly 
150 155 160 

Asn Glu Lys Gly Lys Ala Leu 
165 



10 



25 



30 



40 



55 
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(2) INFORMATION FOR SEQ ID NO: 741: 

(i) SEQUENCE CHARACTERISTICS: 

' (A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741: 

Met Arg lie Leu Gin Leu He Leu Leu Ala Leu Ala Thr Gly Leu Val 
1 5 _ 10 15 



Gly Gly Glu Thr Arg He He Lys Gly Phe Glu Cys Lys Leu His Ser 
20 25 30 

Gin Pro Txp Gin Ala Ala Leu Phe Glu Lys Thr Arg Leu Leu Cys Gly 
35 40 45 

20 Ala Thr Leu He Ala Pro Arg Trp Leu Leu Thr Ala Ala His Cys Leu 
50 55 60 



Lys Pro Arg Tyr He Val His Leu Gly Gin His Asn Leu Gin Lys Glu 
^5 70 75 80 

Glu Gly Cys Glu Gin Thr Arg Thr Ala Thr Glu Ser Phe Pro His Pro 
85 90 95 

Gly Phe Asn Asn Ser Leu Pro Asn Lys Asp His Arg Asn Asp He Met 
100 105 110 

Leu Val Lys Met Ala Ser Pro Val Ser He Thr Trp Ala Val Arg Pro 
115 120 125 

35 Leu Thr Leu Ser Ser Arg Cys Val Thr Ala Gly Thr Ser Cys Ser Phe 
130 135 140 



Pro Ala Gly Ala Ala Arg Pro Asp Pro Ser Tyr Ala Cys Leu Thr Pro 
145 150 155 160 

Cys Asp Ala Pro Thr Ser Pro Ser Leu Ser Thr Arg Ser Val Arg Thr 
165 170 175 



Pro Thr Pro Ala Thr Ser Gin Thr Pro Trp Cys Val Pro Ala Cys Arg 
45 180 185 190 

Lys Gly Ala Arg Thr Pro Ala Arg Val Thr Pro Gly Ala Leu Trp Ser 
195 200 205 

50 Val Thr Ser Leu Phe Lys Ala Leu Ser Pro Gly Ala Arg He Arg Val 
210 215 220 



Arg Ser Pro Glu Ser Leu Val Ser Thr Arg Lys Ser Ala Asn Met Trp 
225 230 235 240 

Thr Gly Ser Arg Arg Arg 
245 



60 
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. (2) INFORMATION FOR SEQ ID NO: 742: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 742: 

Glu Thr Arg He He Lys Gly Phe Glu Cys Lys Leu His Ser Gin Pro 
15 10 15 

Trp Gin Ala Ala Leu Phe Glu Lys Thr Arg Leu Leu Cys Gly Ala Thr 
20 25 30 

Leu He Ala Pro Arg Trp Leu Leu Thr Ala Ala His Cys Leu Lys Pro 
35 40 45 

Arg Tyr He Val His Leu Gly Gin His Asn Leu Gin Lys Glu Glu Gly 
50 55 60 

Cys Glu Gin Thr Arg Thr Ala Thr Glu Ser Phe Pro His Pro Gly Phe 
^5 70 75 80 

Asn Asn Ser Leu Pro Asn Lys Asp His Arg Asn Asp He Met Leu Val 
85 90 95 

Lys Met Ala Ser Pro Val Ser He Thr Trp Ala Val Arg Pro Leu Thr 
100 105 110 

Leu Ser Ser Arg Cys Val Thr Ala Gly Thr Ser Cys Ser Phe Pro Ala 
115 120 125 

Gly Ala Ala Arg Pro Asp Pro Ser Tyr Ala Cys Leu Thr Pro Cys Asp 
130 135 140 

Ala Pro Thr Ser Pro Ser Leu Ser Thr Arg Ser Val Arg Thr Pro Thr 
145 150 155 160 

Pro Ala Thr Ser Gin Thr Pro Trp Cys Val Pro Ala Cys Arg Lys Gly 
165 170 175 

Ala Arg Thr Pro Ala Arg Val Thr Pro Gly Ala Leu Tro Ser Val Thr 
180 185 190 

Ser Leu Phe Lys Ala Leu Ser Pro Gly Ala Arg He Arg Val Arg Ser 
195 200 205 

Pro Glu Ser Leu Val Ser Thr Arg Lys Ser Ala Asn Met Trp Thr Gly 
210 215 220 

Ser Arg Arg Arg 
225 



(2) INFORMATION FOR SEQ ID NO: 743: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 743: 

Cys Lys Leu His Ser Gin Pro Trp Gin Ala Ala Leu ?he Glu Lvs "^hr 
15 10 is * 

Arg Leu Leu Cys Gly Ala Thr Leu He Ala Pro Arg Trp Leu l^u rhr 
20 25 " 30 " * 

Ala Ala His Cys Leu Lys Pro Arg Tyr He Val His Leu Gly 31r. His 
35 40 45 

Asn Leu Gin Lys Glu Glu Gly Cys Glu Gin Thr Arg Thr Ala rhr S^ u 
50 55 60 

Ser Phe Pro His Pro Gly Phe Asn Asn Ser 
65 70 



(2) INPORMATIC»J FOR SEQ ID. NO: 744: 

(i) SEQUENCE CHAKACTERISTICS : 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 744: 

Val Leu Gin Gly Arg Tyr Phe Ser Pro He Leu Glu Ket Arg Arg leu 
1 .5 10 15 

Arg Pro Glu Gly Xaa Xaa Asn Leu Pro Gly Gly Ser Arg Ala 31n Lys 
20 25 3C 

Glu Pro Arg Gin Asp Leu Thr Leu Val Leu Trp Pro Kis Cys Pro His 
35 40 45 

Phe Ala Met Thr Arg Ser Tyr Val Pro Thr Lys Gin Cys Kez Val 31r. 
50 55 60 

Gly Ser Phe Tyr Cys He Phe He Phe Lys Gly Pro Val Glr. Asr. Trp 
65 70 75 80 

Cys 



(2) INFORMATION FOR SEQ ID NO: 745: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 745: 

Met Pro He He Asp Gin Val Asn Pro Glu Leu His Asp Phe Met Gin 
15 10 IS 

Ser Ala Glu Val Gly Thr He Phe Ala Leu Ser Trp Leu He Thr Trp 
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2: 25 



30 



?hs 31y His Val leu Ser As? ?he Arg His Val Val Arg Leu Tyr Asp 
25 40 45 

?h5 Le- Ala c%-3 His Pre Leu Iter Pro lie Tyr Phe Ala Ala Val 

53 50 

He Val Ls-i Tyr Arg Glu Glr/Slu Val Leu Asp Oys Asp Cys Asp Met 
- '0 75 80 

Aia Ser Val His His Lau leu Ser Gin He Pre Gin Asp Leu Pro Tyr 
55 30 • 95 

.31u riir Leu He Ser Arg l-laa Glu rhr Phe leu Phe Ser Phe Pro His 
IOj 105 2.10 

Pre Asr. Leu Leu Gly Arg Pro Leu Pro Asn Ser Lys Leu Arg Gly Arg 

120 125 

31n Pro Le^ Leu Ser Lys Zhr Leu Ser Trp His Gin Pro Ser Arg Gly 
13C 135 140 

l^u He Trp Cys Cys Gly Ser Gly Xaa Arg Gly Leu Leu Arg Pro Glu 
1=0 155 160 

Asp Arg Thr Lys Asp Val leu Thr Lys Pro Arg Thr Asn Arg Phe Val 
1S5 170 175 

lys leu Ala Val 2fe= Gly leu Thr Val Ala Leu Gly Ala Ala Ala Leu 

135 190 

~- --a leu Glu Trp Ala Pro lys Phe Gin Leu Gin 

155 200 205 



(2) i:ZrCPIC.J:iai ?CR SSQ ro >X5: 745: 

Hi SEQ'JSITCZ GJAPACTERISTICS: 

(A) LENGTH : 70 ariino acids 

(B) TYPE: amino acid 
(D) TOPOLCGV: linear 

(xii SEQUEICE DESCRIPTIC?:: SEQ ID 746: 

Cys Pro Glu Phe Phe He Pro Ala Thr Leu Pro Cys Pro Phe Val Phe 
15 10 15 

Ala Phe Thr Ser Glu Ala Ser Ser Arg Ala Tyr Leu Thr Gin Arg Gly 
20 25 30 

Pro Gly Gly Leu Ala Gin Asn Leu Met Pro Leu Pro Val Gly Phe Trp 
35 40 45 

Mer Gly Ser Leu Pro Pro Pro Trp Cys Trp Arg Lys Trp Val Ser Glu 
50 55 60 
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Ala Cys Ser Cys Phe Cys 
65 70 



(2) INFORMATION FOR SEQ ID NO: 747: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 747: 

Gly Phe Gly Ser Val Ser Ala Ala Gly Arg Arg Ser Gly Gly Thr Trp 
15 10 15 

Gin Pro Val Gin 
20 



(2) INFORMATION FOR SEQ ID NO: 748: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 748: 

Pro Gly Gly Leu Ala Val Gly Ser Arg Trp Trp Ser Arg Ser Leu Thr 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 749: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»3 : SEQ ID NO: 749: 

Leu Glu Pro Ser Arg Gin Arg Arg Pro Arg Arg Arg Gly Gly Thr Ser 
1 5 10 15 

Arg Pro Glu Thr Asp Gin Arg Ala Lys Cys Trp Arg Gin Leu 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 750: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 750: 
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Val Cys Leu Arg Cys Gin Asn Arg Met Glu Asn 
1 5 10 



(2) I3SIF0RMATIQN FOR SEQ ID NO: 751: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 751: 

Met Ala Ala Cys Thr Ala Arg Arg Pro Gly Arg Gly Gin Pro Leu Val 
15 10 15 

Val Pro Val Ala Asp Xaa Gly Pro Val Ala Lys Ala Ala Leu Cys Ala 
20 25 30 

Ala Xaa Ala Gly Ala Phe Ser Pro Ala Ser Thr Thr Thr Thr Arg Arg 
35 40 45 

His Leu Ser Ser Arg Asn Arg Pro Glu Gly Lys Val Leu Glu Thr Val 
50 55 60 

Gly Val Phe Glu Val Pro Lys Gin Asn Gly Lys Tyr Glu Thr Gly Gin 
65 70 75 80 

Leu Phe Leu His Ser He Phe Gly Tyr Arg Gly Val Val Leu Phe Pro 
85 90 95 

Trp Gin Ala Arg Leu 3Caa Asp Arg Asp Val Ala Ser Ala Ala Pro Glu 
100 105 110 

Lys Ala Glu Asn Pro Ala Gly His Gly Ser Lys Glu Val Lys Gly Lys 
115 120 125 

Thr His Thr Tyr Tyr Gin Val Leu He Asp Ala Arg Asp Cys Pro His 
130 135 140 

He Ser Gin Arg Ser Gin Thr Glu Ala Val Thr Phe Leu Ala Asn His 
145 150 155 160 

Asp Asp Ser Arg Ala Leu Tyr Ala He Pro Gly Leu Asp Tyr Val Ser 
165 170 175 

His Glu Asp He Leu Pro Tyr Thr Ser Thr Asp Gin Val Pro He Gin 
180 185 190 

His Glu Leu Phe Glu Arg Phe Leu Leu Tyr Asp Gin Thr Lys Ala Pro 
195 200 205 

Pro Phe Val Ala Arg Glu Thr Leu Arg Ala Trp Gin Glu Lys Asn His 
210 215 220 

Pro Trp Leu Glu Leu Ser Asp Val His Arg Glu Thr Thr Glu Asn He 
225 230 235 240 

Arg Val Thr Val He Pro Phe Tyr Mfet Gly Met Arg Glu Ala Gin Asn 



wo 98/54963 PCTAJS98/11422 



730 



245 



250 



255 



Ser His Val Tyr Trp Trp Arg Tyr Cys He Arg Leu Glu Asn Leu Asp 
260 265 270 

5 

ser Asp Val Val Gin Leu Arg Glu Arg His Trp Arg He Phe Ser Leu 
275 280 285 

Ser Gly Thr Leu Glu Thr Val Arg Gly Arg Gly Val Val Gly Arg Glu 
10 290 295 300 

Pro Val Leu Ser Lys Glu Gin Pro Ala Phe Gin Tyr Ser Ser His Val 
305 . 310 315 320 

15 Ser Leu Gin Ala Ser Ser Gly His Met Trp Gly Thr Phe Arg Phe Glu 
325 330 335 



20 



Arg Pro Asp Gly Ser His Phe Asp Val Arg He Pro Pro Phe Ser Leu 
340 345 350 

Glu Ser Asn Lys Asp Glu Lys Thr Pro Pro Ser Gly Leu His Trp 
355 360 365 



25 

(2) INFORMATION FOR SEQ ID NO: 752: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 33 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 752: 

Met Ala Ala Cys Thr Ala Arg Arg Pro Gly Arg Gly Gin Pro Leu Val 
35 1 5 10 15 

Val Pro Val Ala Asp Xaa Gly Pro Val Ala Lys Ala Ala Leu Cys Ala 
20 25 30 

40 Ala - 



45 (2) INFORMATION FOR SEQ ID NO: 753: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 753: 



55 



Met Ala Ala Cys Thr Ala Arg Arg Pro Gly Arg Gly Gin Pro Leu Val 
1^5 XO 15 

Val Pro Val Ala Asp Xaa Gly Pro Val Ala Lys Ala Ala Leu Cys Ala 
20 25 30 



60 



Ala 
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<2) INF0R61ATI0N FOR SEQ ID NO: 754: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE^IGTH: 33 azoino acids 

(B) TY^PE: . amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 754: 

Met Ala Ala Cys Thr Ala Arg Arg Pro Gly Arg Gly Gin Pro Leu Val 
15 10 15 

Val Pro Val Ala Asp Xaa Gly Pro Val Ala Lys Ala Ala Leu Cys Ala 
20 25 30 

Ala 



(2) INFORMATION FOR SEQ ID NO: 755: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 755: 

Met Ala Ala Cys Thr Ala Arg Arg Pro Gly Arg Gly Gin Pro Leu Val 
15 10 15 

Val Pro Val Ala Asp Xaa Gly Pro Val Ala Lys Ala Ala Leu Cys Ala 
20 25 30 

Ala 



(2) INFORMATION FOR SEQ ID NO: 756: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOFOIXXV: linear 

(xi) SEQUENCE DESCRIPTIC^: SEQ ID NO: 756:. 

Mfet Ala Ala Cys Thr Ala Arg Arg Pro Gly Arg Gly Gin Pro Leu Val 
1 5 10 15 

Val Pro Val Ala Asp Xaa Gly Pro Val Ala Lys Ala Ala Leu Cys Ala 
20 25 30 

Ala 
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(2) INFORMATION FOR SEQ ID NO: 757: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQKaTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 757: 

,/ 

Val Leu Glu Thr Val Gly Val Phe Glu Val Pro Lys Gin Asn Gly Lys 
15 10 15 

Tyr Glu Thr Gly Gin Leu Phe Leu His Ser He Phe Gly Tyr Arg Gly 
20 25 30 

Val Val Leu 
35 



(2) INFORMATION FOR SEQ ID NO: 758: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 758: 

Gly Leu Asp Tyr Val Ser His Glu Asp He Leu Pro Tyr Thr Ser Thr 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 759: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 759: 

Asp Val His Arg Glu Thr Thr Glu Asn He Arg Val Thr Val He Pro 
15 10 15 

Phe Tyr Met 



(2) INFORMATION FOR SEQ ID NO: 760: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 760: 

Trp Trp Arg Tyr Cys He Arg Leu Glu Asn Leu Asp Ser Asp Val Val 
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1 



5 



10 



15 



Gin Leu Arg Glu Arg 
20 



(2) INFORMATION FOR SEQ ID NO: 761: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: andno acid 
■ (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 761: 

Pro Ala Phe Gin Tyr Ser Ser His Val Ser Leu Gin -Ala Ser Ser Gly 
15 10 15 

His Met Trp Gly Thr Phe Arg Phe Glu Arg 



(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 762: 

Ser Leu Cys Cys Pro Glu Gly Ala Glu Gly Cys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 763: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 763: 

Gin Leu Lys Lys Thr His Tyr Asp Arg Pro Cys Pro 
15 10 



(2) INFORMATION FOR SEQ ID NO: 764: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 764: 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 762: 



(i) SEQUENCE CHARACTERISTICS: 



Gin Leu Lys Lys Thr His Tyr Asp Arg Pro Cys Pro 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 765: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 amino acids 
' (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 765: 

Ala Gin Arg Lys Lys Glu IfeC Val Leu Ser Glu Lys Val Ser Gin Leu 
15 10 15 

Met Glu Trp Thr Asn Lys Arg Pro Val lie Arg Met Asn Gly Asp Lys 
20 25 30 

Phe Arg Arg Leu Val Lys Ala Pro Pro Arg Asn Tyr Ser Val lie Val 
35 40 45 

Met Phe Thr Ala Leu Gin Leu His Arg Gin Cys Val Val Cys Lys Gin 
50 55 60 

Ala Asp Glu Glu Phe Gin lie Leu Ala Asn Ser Trp Arg Tyr Ser Ser 
65 70 75 80 

Ala Phe Thr Asn Arg lie Phe Phe Ala Met Val Asp Phe Asp Glu Gly 
85 90 , 95 

Ser Asp Val Phe Gin Met Leu Asn Met Asn Ser Ala Pro Thr Phe He 
100 105 110 

Asn Phe Pro Ala Lys Gly Lys Pro Lys Arg Gly Asp Thr Tyr Glu Leu 
115 120 125 

Gin Val Arg Gly Phe Ser Ala Glu Gin He Ala Arg Trp He Ala Asp 
130 135 140 

Arg Thr Asp Val Asn He Arg Val He Arg Pro Pro Asn Met Ala Ala 
145 ISO 155 160 

Arg Trp Arg Phe Trp Cys Val Ser Val Thr 
165 170 



(2) INPORMATIW FOR SEQ ID NO: 766: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 766: 

M^t Val Val Ala Leu Leu He Val Cys Asp Val Pro Ser Ala Ser 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 767: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 767: 

Ala Gin Arg Lys Lys Glu Met Val Leu Ser Glu Lys Val Ser Gin Leu 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 768: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 768: 

Met Glu Trp Thr Asn Lys Arg Pro Val He Arg Met Asn Gly Asp Lys 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 769: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 769: 

Arg Arg Leu Val Lys Ala Pro Pro Arg Asn Tyr Ser Val He Val Met 
1.5 10 15 

Phe Thr Ala Leu Gin Leu His Arg Gin Cys Val Val Cys Lys Gin Ala 
20 25 30 

Asp Glu Glu Phe Gin He Leu Ala Asn Ser Trp Arg Tyr Ser Ser Ala 
35 40 45 

Phe Thr Asn Arg He Phe Phe Ala 
50 55 



(2) INFORMATIC»I FOR SEQ ID NO: 770: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICttJ : SEQ ID NO: 770: 
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Met Val Asp Phe Asp Glu Gly Ser Asp Val Phe Gin Met Leu Asn Mtet 
1 5 10 15 

Asn Ser Ala Pro Thr Phe lie Asn Phe Pro Ala Lys Gly Lys Pro 
20 25 30 



(2) INFORMATION. FOR SEQ ID NO: 771: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 771: 

Lys Arg Gly Asp Thr Tyr Glu Leu Gin Val Arg Gly Phe Ser Ala Glu 
1 5 10 15 

Gin He Ala Arg Trp He Ala Asp Arg Thr Asp Val Asn He Arg Val 
20 25 30 

He Arg Pro Pro Asn 
35 



(2) INFOHMATICN FOR SEQ ID NO: 772: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 772: 

Tyr Ala Gly Pro Leu Met Leu Gly Leu Leu Leu Ala Val He Gly Gly 
1 5 10 15 

Leu Val Tyr - Leu Arg Arg Val He Txp Asn Phe Ser Leu He Lys Leu 
20 25 30 

Asp Gly Leu Leu Gin Leu Cys Val Leu Cys Leu Leu 
35 40 



(2) INFORMATION FOR SEQ ID NO: 773: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LS^GTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 773: 

Asp Ala Val Phe Lys Gly Phe Ser Asp Cys Leu Leu Lys Leu Gly Asp 
1 5 10 15 

Ser 
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(2) INFORMATION FOR SEQ ID NO: 774; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774: 

Cys Gin Glu Gly Ala Lys Asp Met Trp Asp Lys Leu Arg Lys Glu Ser 
1 5 10 15 

Lys Asn Leu Asn 
20 



(2) INFORMATION FOR SEQ ID NO: 775: 

• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 775: 

Val Leu Leu Val Ser Leu Ser Ala Ala Leu Ala Thr Trp Leu Ser Phe 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 776: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 776: 

Met Gly Leu Lys Leu Asn Gly Arg Tyr He Ser Leu He Leu Ala Val 
1 5 10 15 

Gin He Ala Tyr Leu Val Gin Ala Val Arg Ala Ala Gly Lys Cys Asp 
20 25 30 

Ala Val Phe Lys Gly Phe Ser Asp Cys Leu Leu Lys Leu Gly Asp Ser 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 777: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LE2IGTK: 90 amino acids 



1 
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(B) TOPE: amino acid 
(D) TOPOLOGY: linear 
(xi) .SEQUENCE DESCRIPTION: SEQ ID NO: 777: 

5 Pro Ala Ala Trp Asp Asp Lys Thr Asn He Lys Thr Val Cys Thr Tyr 
15 10 15 

Trp Glu Asp. Phe His Ser Cys Thr Val Thr Ala . Leu Thr Asp Cys Gin 
20 25 30 



10 



Glu Gly Ala Lys Asp Met Trp Asp Lys Leu Arg Lys Glu Ser Lys Asn 
35 40 45 



Leu Asn He Gin Gly Ser Leu Phe Glu Leu Cys Gly Ser Gly Asn Gly 
15 50 55 60 

Ala Ala Gly Ser Leu Leu Pro Ala Phe Pro Val Leu Leu Val Ser L.eu 
65 70 75 80 

20 Ser Ala Ala Leu Ala Thr Trp Leu Ser Phe 

85 90 



25 (2) INFORMATION FOR SEQ ID NO: 778: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 778: 

Met Gly Leu Lys Leu Asn Gly Arg Tyr He Ser Leu He Leu Ala Val 
15 10 15 

Gin He Ala Tyr Leu Val Gin Ala Val Arg Ala Ala Gly Lys Cys Asp 
20 25 30 



35 



Ala Val Phe Lys Gly Phe Ser Asp Cys Leu Leu Lys Leu Gly Asp Ser 
40 35 40 45 

Xaa Xaa Xaa Xeia Xaa Pro Ala Ala Trp Asp Asp Lys Thr Asn He Lys 

50 55 60 . ^ 

45 Thr Val Cys Thr Tyr Trp Glu Asp Phe His Ser Cys Thr Val Thr Ala 
65 70 75 80 

Leu Thr Asp Cys Gin Glu Gly Ala Lys Asp Mfet Trp Asp Lys Leu Arg 
85 . 90 95 

50 

Lys Glu Ser Lys Asn Leu Asn He Gin Gly Ser Leu Phe Glu Leu Cys 
100 105 iio 

Gly Ser Gly Asn Gly Ala Ala Gly Ser Leu Leu Pro Ala Phe Pro Val 
55 115 120 125 

Leu Leu Val Ser Leu Ser Ala Ala Leu Ala Thr Trp Leu Ser Phe 
130 135 140 

60 



i 
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(2) INFORMATION FOR SEQ ID NO: 779: 

(i) SEQUENCE CHARACTERISTICS: 
5 <A) LENGTH: 34 amino acids 

(B) TiTPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 779: 

10 Met Asn Ser Ala Ala Gly Phe Ser His Leu Asp Arg Arg Glu Arg Val 
1 5 10 . 15 



15 



Leu Lys Leu Gly Glu Ser Phe Glu Lys Gin Pro Arg Cys Ala Ser Thr 
20 25 30 

Leu Cys 



20 

(2) INFORMATION FOR SBQ ID NO: 780: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 780: 

Thr He Tyr Pro Thr Glu Glu Glu Leu Gin Ala Val Gin Lys He Val 
30 1 5 10 15 

Ser He Thr Glu Arg Ala Leu Lys Leu .Val Ser Asp 
20 25 

35 

(2) INFORMATION FOR SEQ IQ .NO: 781: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 30 amino acids 

<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 781: 

45 Arg Ala Leu Lys Gly Val Leu Arg Val Gly Val Leu Ala Lys Gly Leu 
15 10 • 15 

Leu Leu Arg Gly Asp Arg Asn Val Asn Leu Val Leu Leu Cys 
20 25 30 

50 



(2) INFORMATION FOR SBQ ID NO: 782: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 782: 

60 



wo 98/54963 



PCT/US98/11422 



740 



Ala Leu Ala Ala Leu Arg His Ala Lys Trp Phe Gin Ala Arg Ala Asn 
15 10 15 

Gly Leu Gin Ser Cys Val He He He Arg He Leu Arg Asp Leu Cys 
5 20 ' 25 30 

Gin Arg Val Pro Thr Trp Ser 

.35 

10 

(2) INFORMATION FOR SEQ ID NO: 783: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 783: 

20 Gly Asp Ala Leu Arg Arg Val Phe Glu Cys He Ser Ser Gly He He 

1 5 . 10 . 15 

Leu 

25 



(2) INFORMATION FOR SEQ ID NO: 784: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 784: 



35 



40 



45 



Leu Ala Phe Arg Gin He His Lys Val Leu Gly Met Asp Pro Leu Pro 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 785: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 amino acids 
- (B) TYPE: amino acid 
(D) TOPOLOGY: linear 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 785: 

Thr- He Tyr Pro Thr Glu Glu Glu Leu Gin Ala Val Gin Lys He Val 
1 5 10 • 15 

55 Ser He Thr Glu Arg Ala Leu Lys Leu Val Ser Asp Ser Leu Ser Glu 
20 25 30 



His Glu Lys Asn Lys Asn Lys Glu Gly Asp Asp Lys Lys Glu Gly Gly 
35 40 45 
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Lys Asp Arg Ala Leu Lys Gly Val Leu Arg Val Gly Val Leu Ala Lys 
50 55 60 

Gly Leu Leu Leu Arg Gly Asp Arg Asn Val Asn Leu Val Leu Leu Cys 
5 65 70 75 80 

Ser Glu Lys Pro Ser Lys Thr Leu Leu Ser Arg lie Ala Glu Asn Leu 
85 90 95 

10 Pro Lys Gin Leu Ala Val lie Ser Pro Glu Lys Tyr Asp He Lys Cys 
100 105 110 



15 



Ala Val Ser Glu Ala Ala He He Leu Asn Ser Cys Val Glu Pro Lys 
115 120 125 

Met Gin Val Thr He Thr Leu Thr Ser Pro He He Arg Glu Glu Asn 

130 135 140 • 



20 



25 



30 



35 



Met Arg Glu Gly Asp Val Thr Ser Gly Met Val Lys Asp Pro Pro Asp 
145 150 155 160 

Val Leu Asp Arg Gin Lys Cys Leu Asp Ala Leu Ala Ala Leu Arg His 
165 170 175 

Ala Lys Trp Phe Gin Ala Arg Ala Asn Gly Leu Gin Ser Cys Val He 
180 185 190 

He He Arg He Leu Arg Asp Leu Cys Gin Arg Val Pro Thr Trp Ser 
195 200 205 

Asp Phe Pro Ser Trp Ala Met Glu Leu Leu Val Glu Lys Ala He Ser 
210 215 220 

Ser Ala Ser Ser Pro Gin Ser Pro Gly Asp Ala Leu Arg Arg Val Phe 
225 230 235 240 

Glu Cys He Ser Ser Gly lie. He Leu Lys Gly Ser Pro Gly Leu Leu 
245 * 250 255 



40 



45 



Asp Pro Cys Glu Lys Asp Pro Phe Asp Thr Leu Ala Thr Met Thr Asp 
260 265 270 

Gin Gin Arg Glu Asp He Thr Ser Ser Ala Gin Phe Ala Leu Arg Leu 
275 280 285 

Leu Ala Phe Arg Gin lie His Lys Val Leu Gly Met Asp Pro Leu Pro 
290 295 300 



Gin Met Ser Gin Arg Phe Asn He His Asn Asn Arg Lys Arg Arg Arg 
50 305 310 315 320 

Asp ser Asp Gly Val Asp Gly Phe Glu Ala Glu Gly Lys Lys Asp Lys 
325 330 335 



55 Lys Asp Tyr Asp Asn Phe 
340 



60 



(2) INFORMATICS FOR SEQ ID NO: 786: 
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10 



15 



30 



45 



50 



(i) SEQUENCE CHARACTERISTICS: . 

(A) LE17GTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 786: 

Met Gly Ser Gin His Ser Ala Ala Ala Arg Pro Ser Ser Cys Arg Arg 
1 5 10 15 

Lys Gin Glu Asp Asp Arg Asp Gly 
20 



(2) INFORMATION FOR SEQ ID NO: 787: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 787: 

Leu Leu Ala Glu Arg Glu Gin Glu Glu Ala He Ala Gin Phe Pro Tyr 
25 • 1 5 10 15 

Val Glu Phe Thr Gly Arg Asp Ser He Thr Cys Leu Thr Cys 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 788: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 34 amino acids 

<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIOI: SEQ ID NO: 788: 

40 Gin Gly Thr Gly Tyr He Pro Thr Glu Gin Val Asn Glu Leu Val Ala 
1 5 • 10 15 

Leu He Pro His Ser Asp Gin Arg Leu Arg Pro Gin Arg Thr Lys Gin 
20 25 30 



Tyr Val 



(2) INFORMATION FOR SEQ ID NO: 789: 



' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 789: 



Ala Arg Leu Asn Val Gly Arg Glu Ser Leu Lys Arg Glu Met Leu Lys 
60 1 5 10 15 
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Ser Gin Gly Val Lys Val Ser Glu Ser Pro Met Gly Ala Arg His Ser 
20 25 30 

Ser Trp Pro Glu Gly Ala Ala Phe Cys Lys Lys Val Gin Gly Ala Gin 
35 40 45 

Met Gin Phe Pro Pro Arg Arg 
50 55 



(2) INFORMATION FOR SEQ ID NO: 790: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790: 

Ala Arg Leu Asn Val Gly Arg Glu Ser Leu Lys Arg Glu Met Leu 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 791: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 791: 

Leu Lys Ser Gin Gly Val Lys Val Ser Glu Ser Pro Met Gly Ala Arg 
15 10 15 

His Ser Ser Trp 
20 



(2) INFORMATION FOR SEQ ID NO: 792: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIO): SEQ ID NO: 792: 

Ala Phe Cys Lys Lys Val Gin Gly Ala Gin Met Gin Phe Pro Pro Arg 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 793: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 793: 

Ala Phe Cys Lys Lys Val Gin Gly Ala Gin Met Gin Phe Pro Pro «rg 
1 5 . 10 15 



(2) INFORMATION FOR SEQ ID NO: 794: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NC: 794: 

val Gin Val Leu Glu Gin Leu Thr Asn Asn Ala Val Ala Glu Ser ^jrg 
1 5 * 10 15 

Phe Asn Asp Ala Ala Tyr Tyr Tyr Trp Met Leu Ser Met Gin Cys Z^m 
20 . 25 30 

Asp lie Ala Gin Asp 
35 



<2) INFORMATIOI FOR SEQ ID NO: 795: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: aiaino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795: 

Pro Ala Gin Lys Asp Thr Met Leu Gly Lys Phe Tyr His ?r:e Gin Arg 
1 5 10 15 

Leu Ala Glu Leu Tyr His Gly Tyr His Ala He His Arg His Thr Glu 
20 25 30 

Asp Pro 



(2) INFORMATION FOR SEQ ID NO: 796: 

(i) SEQUEUCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 796: 

Leu Ala Lys Gin Ser Lys Ala Leu Gly Ala Tyr Arg Leu Ala Arg His 
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10 



15 



Ala Tyx Asp Lys Leu Arg Gly* Leu Tyr lie Pro 
20 25 



(2) INFORMATIW FOR SEQ ID NO: 797: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 797: 

Ala Arg Phe Gin Lys Ser He Glu Leu Gly Thr Leu Thr He Arg Ala 
1 5 10 15 

Lys Pro Phe His Asp Ser Glu Glu Leu Val Pro Leu Cys Tyr Arg Cys 
20 25 30 

Ser Thr Asn Asn 
35 



(2) INFORMATION FOR SEQ ID NO: 798: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 798: 

Pro Leu Leu Asn Asn Leu Gly Asn Val Cys He Asn. Cys Arg Gin Pro 
1 5 10 15 

Phe He Phe Ser Ala Ser Ser Tyr Asp Val Leu His Leu Val Glu Phe 
20 25 30 

Tyr Leu Glu Glu Gly He Thr Asp Glu Glu Ala He Ser Leu He Asp 
35'' 40 45 

Leu Glu Val Leu Arg Pro Lys Arg Asp Asp Arg Gin Leu Glu He Cys 
50 55 60 

Lys Gin Gin Leu Pro Asp Ser Cys Gly 
65 .70 



(2) INFORMATION FOR SEQ ID NO: 799: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 799: 

Met Pro Tyr Ala Gin Trp Leu Ala Glu Asn Asp Arg Phe Glii Glu Ala 
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10 



15 



Gin Lys Ala Phe His Lys Ala Gly Arg Gin Arg Glu Ala 
20 25 



(2) INFORMATION FOR SEQ ID NO: 800: 

(i) SEQUENCE CHAEy^CTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800: 

Phe Ser Val His Arg Pro Glu Thr Leu Phe Asn He Ser Arg Phe Leu 
1 5 10 15 

Leu His Ser Leu Pro Lys Asp Thr Pro Ser Gly He Ser Lys Val Lys 
20 25 30 

He Leu Phe Thr 
35 



> 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 136m; 



A. The indications made below relate to the microorganism referred to in the description 
on page 161 Jinc N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (incittding postal code and country) 

1 080 1 University Boulevard ^ 
Manassas, Virginia 201 10-2209 
United States of America 



Date of deposit March 27, 1 997 


Accession Number 97979 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MM^t (if the mdicaaons are not for aUdesignaud States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 

The indications listed below will be submitted to the international Bureau later {specify the general nature of the indications, e.g.. "Accession 
Number of Deposit") * 



For receiving Office use only . 



Its sheet was receiyedrv^ti^Uiain^]^ application 



Authorized officer 



0 4 JUN ms 



• For International Bureau use only < 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulcl36tf; 



A. The indications made below relate to the microorganism referred to in the description 
on page 162 Jinc N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet ;- " 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20 1 1 0-2209 
United States of America 



Date of deposit April 4, 1 997 



Accession Number 97974 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE Of the indicaiions are not for all designated Siaies) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e e "Accession 
Number of Deposit**) 



For receiving Office use only , 



m 



This sheet was- received with theOitemational application 



Authorized officer 



> For International Bureau use only < 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 13A/5; 



A. The indications made below relate to the microorganism referred to in the description 
on page 162 .line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet r- 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 10-2209 
United States of America 



Date of deposit May 29, 1997 



Accession Number 209080 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (iftheindiaaiom are notforatt designated States) 



E. SEPARATE FURNISHING OF INDICATIONS fleave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e,z "Accession 
Number of Deposit*^ 



For receiving Office use only , 



This sheet was ^e^fdw^ihc tntemational application 

^^rr-icrr:V-c:aiist 



Authorized officer 



I For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule ISAw; 



A. The indications made below relate to the microorganism referred to in the description 
on page 164 ,line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit December 3, 1 997 



Accession Number 2095 1 1 



C. ADDITIONAL INDICATIONS rieave blank if not applicable) This information is continued on an additional sheet 



D, DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E, SEPARATE FURNISHING OF INDICATIONS Heave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later {specif the general namre of the indications, e.g., "Accession 
Number ofDeposif^ 



For receiving Office use only , 



SThis sheet was recej«ed.with the international application 
^.^';jmcn Smith 



Authorized ofTicer 



0 4 JUN rm 



» For International Bureau use only < 



□ 



This sheet was received by the International Buieau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 



(PCT Rule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 167 .line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet : " 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit April 4, 1997 



Accession Number 97975 



C. ADDITIONAL INDICATIONS {leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (ifthe indications art not for all designated Simes) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later {specify the general nature of the indications, e.g., "Accession 
Number of Deposit") 



For receiving OfTice use only , 



This sheet was received* wiUi thennteiiiational application 



Authorized officer 



0 4 JUN 1998 



• For International Bureau use only i 



□ 



This sheet was received by the Intemational Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulelS^is; 



A. The indications made below relate to the microorganism referred to in the description 
on page 167 ,iine N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet ^ 

Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit May 29, 1997 



Accession Number 20908 1 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the uuOcaHom are nm for all designoied States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., "Accession 
ivumtfer of Deposit j • 



For receiving Office use only , 



0 



This sheet was received with the intemationai application 



Authorized officer 



G4 jUN ' 



I For Intemationai Bureau use only 



□ 



This sheet was received by the Intemationai Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 13^5; 



A. The indications made below relate to the microorganism referred to in the description 
on page 171 , Unc N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20 1 1 0-2209 
United States of America 



Date of deposit April 4, 1997 



Accession Number 97976 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



a DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if iheindicaUons are noi for all designated Siaies) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



Sr o/KiO^ "^"^ submitted to the International Bureau later {specif the general nature of the indications, e.g.. 



g, "Accession 



For receiving Office use only , 



This sheet was received with the thteriiaiionatlqpplication 



Authorized ofTiccr 

,0 4 m I'^so 



■ For International Bureau use only . 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulelSAis; 

A. The indications made below relate to the microorganism referred to in the description 

on page 172 . line N/A 

B. IDENTinCATION OF DEPOSIT Further deposits are identified on an additional sheet - 

Name of depositary institution 

American Type Culture Collection 

Address of depositary institution {induding postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 10-2209 
United States of America 



Date of deposit April 4, 1 997 


Accession Number 97977 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are noifor all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specif the general nature of the indications, e g '•Accession 
Number of Deposit") 



For receiving Office use only , 



is sheet was received Avith tbfjatcfn^tional application 

P:-;:-^! G::scia!i3t 



Authorized officer 



0 4 juN me 



I For international Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 136»; 



A. The indications made below relate to the microorganism referred to in the description 
on page, 172 .line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {inciuding postal code and country) 

1 080 1 University Boulevard 
Manassas, Virginia 201 10-2209 
United States of America 



Date of deposit May 29, 1 997 



Accession Number 209082 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicmiom are not for att designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g.. "Accession 
Ntunber of Deposit $ 



For receiving Office use only , 



jry^'S sheet was received with the InierMtnnal appiication 



Authorized officer 



0 4 Sm tggg 



• For International Bureau use only ^ 



□ 



This sheet was received by the International Bureau on: 



AuUiorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 13A/sr; 

A. The indications made below relate to the microorganism referred to in the description 

on page 176 jinc N/A . 

B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet - 
Name of depositary institution 

American Type Culture Collection 

Address of depositary institution {including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit April 28, 1997 


Accession Number 209007 


€• ADDITIONAL INDICATIONS (leave blank \f not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (IftheindicationM are na/oratt designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 

indications listed below will be submitted to the Imemarional Bureau later {specify the general nature of the indications, e.g., "Accession 
ber of Deposit") 



The 
Number 



For receiving Office use only , 



sheet was received .with the international application 



Authorized ofTicer 

0 4 JUN 1998 



i For International Bureau use only < 



□ 



this sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulel3&/$; 



A. The indications made below relate to the microorganism referred to in the description 
on page 176 .line N/A 



B. IDENTIFICATION OP DEPOSIT Further deposits are identified on an additional sheet ~ 

Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit May 29, 1997 



Accession Number 209083 



C. ADDITIONAL INDICATIONS {leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE mHeimaeadoa, are noifor all designaied states) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specif the general nature of the indications, e.g, "Accession 
Number of Deposit j 



For receiving Office use only , 



|~| This sheet was r)ueived.with ihean^ational application 

7-.;r::r~T.; •;o3c:alist 
• ••••• : orations 



Authorized ofitcer ./-I 

G 4 JUN 1998' 



■ For International Bureau use only < 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



wo 98/54963 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 



A. The indications made below relate to the microorganism referred to in the description 
on page 179 , Hne N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

1 080 1 University Boulevard 
Manassas, Virginia 201 10-2209 
United States of America 



Date of deposit April 28, 1997 



Accession Number 209008 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet ~~ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE {if the UuHcadons are not for aU designated States) 



E. SEPARATE FURNISHING OF INDICATIONS fleave blank if not applicable) 



The indications listed below will be submitted to the Intematibnal Bureau later (specify the general nature of the indications, e.z.. "Accession 
Number of Deposit") * * 



For receiving Office use only , 



□ 



This sheet was received with the international application 



Authorized ofTicer 



I For International Bureau use only • 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

9 

(PCTRule Ubis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 179 jine N/A 



B. IDENTIFICATION OF DEPOSIT further deposits are identified on an additional sheet 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit May 29, 1997 



Accession Number 209084 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional she^ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the mi&eaiiom arena for all designaied States) 



E, SEPARATE FURNISHING OF IN DICATIONS (lea^ blank if not applicable) 

N^fofDe^U^^ ^ submitted to the International Bureau later {specify the general nature of the indications, e.g.. ''Accession 



For receiving Office use only , 



□ 



This sheet was received with (he tntanb^onal application 

• -"^csrar/ons 



Authorized ofTicer 

0 4 JUN 1998 



• For International Bureau use only ( 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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^ INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 13Wj; 



A. The indications made below relate to the microorganian referred to in the description 
on page 180 Jinc N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 10-2209 
United States of America 



Date of deposit April 28, 1997 


Accession Number 2090 1 0 


C* ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet * 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE mhemmcaaomarenotforattdesignaiedSmes) 



E. SEPARATE FURNISHING OF INDICATIONS fleave blank if not applicable) 

TTic indications listed below will be submitted to Uie international Bureau later {specify the general nature of the indications, e.g., ''Accession 
Number of Deposit'^ 



MMHMM. For receiving Office use only 

p*^ This sheet was received with the international application 



^■■■^^ For International Bureau use only 

j"~j This sheet was received by tiie International Bureau on: 



Authorized officer 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulel3to; 



A. The indications made below relate to the microorganism referred to in the description 
on page 180 .line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet r- 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit May 29, 1 997 


Accession Number 209085 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE nfthe indications are not/or all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indiMtions listed below will be submitted to the IntemaUonal Bureau later {specify the general nature of the indications, e.g.. "Accession 
Number of Deposit"^ 



For receiving Office use only , 



□ 



This sheet was received withlhe inticnraSS&al application 



Authorized officer 



0 4 JUN 1998 



• For International Bureau use only i 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulel3A«; 

A. The indications made below relate to the microorganism referred to in the description 

on page 182 .line WA 

B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet □ 
Name of depositary institution 

American Type Culture Collection 

Address of depositary institution (including postal code and couniry) 

10801 University Boulevard 
Manassas, Virginia 201 10-2209 
United States of America 



Date of deposit April 28, 1997 


Accession Number 209009 


C, ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (ifthe indications are naifor all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 

The indications listed below will be submitted to the Intenoational Bureau later {specif the general nantre of the indications, e.g. "Accession 
Number qf Deposit*^ * 



For receiving Office use only , 



□ 



This sheet was received with the intemationai application 



Authorized officer 



■ For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 



(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 186 .line N/A 



R IDENTinCATION OF DEPOSIT Further deposits arc identified on an additional sheet rn 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

1 080 1 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of dqwsit April 28. 1 997 



Accession Number 2090 1 1 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This infonnation is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (inheintBcaUons are noi/oraadesigaaied States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 

The indications listed below will be submitted to the Intemaiional Bureau later {specif the general nantre of the indications, e.g.. "Accession 
Number of Deposit") 



For receiving Office use only , 



lis sheet was received with the'inteniBtiona] application 

.:. . ... ip3claiist 
vrD-r CT Cceraiions 



Authorized officer 



0 4 M 1998 



I For International Bureau use only 



□ 



This sheet was received by the international Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulel3fcB; 

A. The indications made below relate to the microoiganism referred to in the description 

on page 174 .line N/A 

B. IDENTIHCATION OF DEPOSIT Further deposits are identified on an additional sheet Q 

Name of depositary institution 

American Type Culture Collection 

Address of depositary institution {including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit April 7. 1 998 


Accession Number 209746 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet ™ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the mdications are not for aU designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below wiH be submitted to the International Bureau later {specify the general nature of the indications, e.g.. 'Accession 
Number of Deposit*') 



For receiving Office use only , 



is sheet was received with the international application 



Authorized officer 



0 4 M 1998 



I For International Bureau use only « 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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What Is Claimed Is: 

1 . An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence selected from the group 
consisting of: 

(a) a polynucleotide fragment of SEQ ID NO:X or a polynucleotide fragment of 
the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID 
NO:X; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID NO: Y or a 
polypeptide fragment encoded by tiie cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(c) a polynucleotide encoding a polypeptide domain of SEQ ID NO: Y or a 
polypeptide domain encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(d) a polynucleotide encoding a polypeptide epitope of SEQ ID NO: Y or a 
polypeptide epitope encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(e) a polynucleotide encoding a polypeptide of SEQ ID NO: Y or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X, 
having biological activity; 

(f) a polynucleotide which is a variant of SEQ ID NO:X; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO:X; 

(h) a polynucleotide which encodes a species homologue of the SEQ ID NO: Y; 

(i) a polynucleotide capable of hybridizing under stringent conditions to any 
one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not 
hybridize under stringent conditions to a nucleic acid molecule having a nucleotide 
sequence of only A residues or of only T residues. 

2 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding a secreted protein. 

3 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding tiie sequence 
identified as SEQ ID NO: Y or the polypeptide encoded by the cDNA sequence included 
in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X. 
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4 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises the entire nucleotide sequence of SEQ ID NO:X or 
the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID 
NO:X. 

5 . The isolated nucleic acid molecule of claim 2, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

6 . The isolated nucleic acid molecule of claim 3, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

7. A recombinant vector comprising the isolated nucleic acid molecule of 
claim 1. 

8. A method of making a recombinant host cell comprising the isolated 
nucleic acid molecule of claim 1 . 

9. A recombinant host cell produced by the method of claim 8. 

10. The recombinant host cell of claim 9 comprising vector sequences. 

11. An isolated polypeptide comprising an amino acid sequence at least 95% 
identical to a sequence selected from the group consisting of: 

(a) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No:Z; 

(b) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No:Z, having biological activity; 

(c) a polypeptide domain of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(d) a polypeptide epitope of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(e) a secreted form of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(f) a full length protein of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; 
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(g) a variant of SEQ ID NO: Y; 

(h) an allelic variant of SEQ ID NO: Y; or 

(i) a species homologue of the SEQ ID NO: Y. 

12. The isolated polypeptide of claim 1 1 , wherein the secreted form or the 
full length protein comprises sequential amino acid deletions from either the C-terminus 
or the N-terminus. 

1 3. An isolated antibody that binds specifically to the isolated polypeptide of 
claim 11. 

14. A recombinant host cell that expresses the isolated polypeptide of claim 

11. 

15. A method of making an isolated polypeptide comprising: 

(a) culturing the recombinant host cell of claim 14 under conditions such that 
said polypeptide is expressed; and 

(b) recovering said polypeptide. 

16. The polypeptide produced by claim 15. 

17. A method for preventing, treating, or ameliorating a medical condition, 
comprising administering to a manmialian subject a therapeutically effective amount of 
the polypeptide of claim 1 1 or the polynucleotide of claim 1 . 

18. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or absence of a mutation in the polynucleotide of 
claim 1; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or absence of said mutation. 

19. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or amount of expression of the polypeptide of 
claim 1 1 in a biological sample; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or amount of expression of the polypeptide. 
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20. A method for identifying a binding partner to the polypeptide of claim 1 1 
comprising: 

(a) contacting the polypeptide of claim 1 1 witfi a binding partner, and 

(b) determining whether the binding partner effects an activity of the 
polypeptide. 

2 1 . The gene corresponding to the cDNA sequence of SEQ ID NO: Y. 

22. A method of identifying an activity in a biological assay, wherein the 
method comprises: 

(a) expressing SEQ ID NO:X in a cell; 

(b) isolating the supernatant; 

(c) detecting an activity in a biological assay; and 

(d) identifying the protein in the supernatant having tiie activity. 

23. The product produced by tiie method of claim 22. 
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